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Patent 18810-81552 
GENETIC MARKER FOR SPONDYLOEPIMETAPHYEAL DYSPLASIA 

The U.S. Government has a paid-up license in this invention and the right 
in limited circumstances to require the patent owner to license others on reasonable terms 
as provided for by the terms of grants HD22657 and AR02038 awarded by the NIH. 

5 BACKGROUND OF THE INVENTION 

Throughout the application various publications are referenced in 
parentheses. The disclosures of these publications in their entireties are hereby 
incorporated by reference in the application in order to more fully describe the state of the 
art to which this invention pertains. 



10 1 . The Field Of The Invention 

This invention relates to the medical arts. In particular, it relates to a 
genetic marker that is useful for diagnosing or treating spondyloepimetaphyseal dysplasia 
and for identifying genetic carriers of heritable alleles associated with 
spondyloepimetaphyseal dysplasia. 

1 5 2. Discussion Of The Related Art 

Osteochondrodysplasias are a genetically heterogeneous group of 
disorders related to cartilage producing cells. Abnormalities in cartilage formation can 
cause defects in bone deposition, skeletal development, linear growth, and the continued 
maintenance of cartilage and bone. (Reviewed in Mundlos, S. and Olsen, B. R., Heritable 
20 diseases of the skeleton. Part II: Molecular insights into skeletal development-matrix 
components and their homeostasis,, FASEB J. 1 1(4):227-33 [1997]). 

There are numerous and disparate causes of osteochondrodysplasias. 
(Horton, W.A., Molecular genetic basis of the human chondrodysplasias, Endocrinol. 
Metab. Clin. North Am. 25(3):683-97 [1996]). A significant number are due to defects in 
25 the collagen genes themselves. (Reviewed in Williams, C.J. and Jiminez, S.A., Heritable 
diseases of cartilage caused by mutations in collagen genes, J. Rheumatol. Suppl. 43:28- 
33 [1995]; Byers, P.H., Molecular genetics of chondrodysplasias, including clues to 
development, structure, and function, Curr. Opin. Rheumatol. 6(3):345-50 [1994] ). In 



Patent 



18810-81552 



addition to collagen defects, several forms of osteochondrodysplasias are caused by 
mutations in the cartilage oligomeric matrix protein (COMP). (Ikegawa, S., et aL, Novel 
and recurrent COMP mutations in preudoachondroplasia and epiphyseal dysplasia, 
Hum. Genet. 103(6):633-8 [1998]; Briggs ? M.D., et aL, Diverse mutations in the gene for 
5 cartilage oligomeric matrix protein in the pseudoachondroplasia-multiple epiphyseal 
dysplasia disease spectrum, Am. J. Hum. Genet. 62(2):31 1-9 [1998]; Briggs, MX)., et aL, 
Pseudoachondroplasia and multiple epiphyseal dysplasia due to mutations in the 
cartilage matrix protein gene, Nat. Genet. 10(3):330-6 [1995]; Ballo, R., et aL, Multiple 
epiphyseal dysplasia, ribbing type: a novel point mutation in the COMP gene in a South 
10 African family, Am. J. Med. Genet. 68(4):396-400 [1997]). 

Still other osteochondrodysplasias have been found to be caused by defects 

W 

; v g in secreted peptide growth factors and their receptors, (E.g., Thomas, J.T., et aL, A 

r:d 

: s g human chondroplysplasia due to a mutation in a TGF-fl superfamily member, Nat. Genet. 

^ 12(3):315-7 [1996]; Bonaventure, J., et ah, Common mutations in the gene encoding 

CO 15 FGFR-3 account for achondroplasia, hypochondroplasia and thanatophoric dysplasia, 
^ Acta Paediatr. Suppl. 417:33-38 [1996]). 

g Finally, mutations in genes which affect protein sulfation cause some 

forms of osteochondrodysplasia. Protein sulfation is a post-translational modification 
!«*■ carried out by all cells, (Lipmann, F., Biological sulfate activation and transfer, Science 

20 128:575-580 [1958]). The primary source of sulfur for the sulfation pathway is free 
sulfate, which can be transported into the cytoplasm by one of a variety of transmembrane 
symporter or antiporter molecules. (Elgavish, A., et ah, Sulfate transport in human lung 
fibroblasts (IMR-90), J. Cell Physiol. 125:243-250 [1985]; Markovich, D., et aL, 
Expression cloning of rat renal Na+ZSO/ 2 ^ cotransport, Proc. Natl. Acad. Sci. U.S.A. 

25 90:8073-8077 [1993]; Bissig, M, 3 et aL, Functional expression cloning of the canalicular 
sulfate transport system of rat hepatocytes, J. Biol.Chem. 269:3017-3021 [1994]; 
Hastbacka, J., et aL , The diastrophic dysplasia gene encodes a novel sulfate transporter: 
positional cloning by fine-structure linkage disequilibrium mapping, Cell 78:1073-1087 
[1994]; Everett, L.A., et aL, Pendred syndrome is caused by mutations in a putative 

30 sulphate transporter gene (PDS), Nat. Genet 17:41 1-422 [1997]). 
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Within the cytoplasm, sulfate is activated to a high energy form in two 
enzymatic steps (Geller, D.H., et al 9 Co-purification and characterization of ATP- 
sulfurylase and adenosine- 5'-phosphosulf ate kinase from rat chondrosarcoma, J. Biol. 
Chem, 262:7374-7382 [1987]). First, utilizing ATP and sulfate as substrates, an ATP 
sulfurylase activity catalyzes the synthesis of adenosine 5'-phosphosulfate (APS). 
Subsequently, an APS kinase activity catalyzes the phosphorylation of the APS to 
generate 3'-phosphoadenosine 5'-phosphosulfate (PAPS). PAPS is the universal 
bioactivated sulfate donor used in all known post-translational sulfation reactions. 

Secreted extracellular matrix proteins are post-translationally sulfated in 
the Golgi, and delivery of PAPS to the Golgi is mediated by a PAPS translocase activity. 
Microsomal proteins with PAPS binding activity have been identified (Mandon, E.C., et 
al 9 Purification of the golgi adenosine 3 -phosphate 5 f -phosphosulfate transporter, 
homodimer within the membrane, Proc.Natl.Acad.Sci.U.S.A. 91:10707-10711 [1994]; 
Ozeran, J.D., et al, Kinetics of PAPS translocase: evidence for an antiport mechanism^ 
Biochemistry 35:3685-3694 [1996]), but the tissue specificity and the contribution of 
these proteins to PAPS transport remains unknown. Following transport, sulfation 
reactions are carried out by substrate-specific sulfotransferases. A major class of 
sulfation substrates within the Golgi is the side-chains of proteoglycans, which are 
abundant structural proteins of the extracellular matrices of many tissues. Proteoglycans 
are particularly abundant in the extracellular matrix of cartilage. 

Direct evidence that sulfation of extracellular matrix proteins is essential 
for proper matrix function was revealed by the identification of mutations in the 
diastrophic dysplasia sulfate transporter, DTDST. (Hastbacka et al [1994]). Mutations in 
the DTDST gene produce a spectrum of recessively inherited osteochondrodysplasia 
phenotypes. (Hastbacka et al [1994]; Hastbacka, J., et al 9 Atelosteogenesis type II is 
caused by mutations in the diastrophic dyplasia sulfate transporter gene (DTDST): 
Evidence for a phenotypic series involving three chondrodysplasias. Am. J, Hum. Genet, 
58:255-262 [1996]; Superti-Furga, et al., A family of chondrodysplasias caused by 
mutations in the diastrophic dysplasia transporter gene and associated with impaired 
sulfation of proteoglycans, Ann. N. Y. Acad. Sci. 785:195-201 [1996]). The severity of 

3 
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the three known disorders, i.e., the moderately severe diastrophic dysplasia phenotype 
and the lethal forms, atelosteogenesis type II and achondrogenesis type IB, is correlated 
with the consequences of the mutations on the activity of the transporter. (Superti-Furga, 
A, et ah Achondrogenesis type IB is caused by mutations in the diastrophic dysplasia 
sulfate transporter gene, Nature Genet. 12:100-02 (1996). The mutations lead to 
dramatically reduced proteoglycan sulfation in cartilage, particularly the chondroitin 
sulfate side chains of aggrecan. However, even in the most severe disorder in the group, 
some proteoglycan sulfation can be measured. (Rossi, A., et al, Undersulfation of 
proteoglycans synthesized by chondrocytes from patient with achondrogenesis type IB 
homozygous for an L483P substitution in the diastrophic dysplasia sulfate transporter, J. 
Biol. Chem. 271:18456-64 [1996]; Rossi, A., et al, Undersulfation of cartilage 
proteoglycans ex vivo and increased contribution of amino acid sulfur to sulfation in vitro 
in McAlister dysplasia/atelosteogenesis type 2, Eur. J. Biochem. 248:741-47 [1997]; 
Rossi, A., et al., Proteoglycan sulfation in cartilage and cell cultures from patients with 
sulfate transporter chondrodyplasias: relationship to clinical severity and indications on 
the role of cellular sulfate production, Matrix Biology 17:361-69[1998]). 

Biochemical evidence that a defect in another step in the sulfation pathway 
can produce an osteochondrodysplasia phenotype was provided by studies in the 
brachymorphic (bm) mouse. The brachymorphic phenotype is characterized by 
disproportionate short-limb dwarfism, a short spine and tail, and a domed skull. (Lane & 
Dickie [1968]). The brachymorphic mouse also exhibits an increased bleeding time, but 
tests of platelet function, including aggregation and secretion, have so far failed to reveal 
specific functional deficits in brachymorphic platelets. (Rusiniak, M.E., et ah, Molecular 
markers near the mouse brachymorphic (bm) gene, which affects connective tissues and 
bleeding time, Mamm. Genome 7:98-102 [1996]) 

In brachymorphic mice, abnormal growth plates with a structurally 

abnormal cartilage extracellular matrix and short chondrocyte columns, with 

comparatively unaligned cells, are apparent on histologic analysis. (Lane & Dickie 

[1968]; Orkin et al [1976]; Orkin et a/.[1977]; Miller, W.A. & Flynn-Miller, K.L., 

Achondroplastic, brachymorphic and stubby chondrodystophies in mice, J. Comp. Pathol. 

4 
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86:349-63 [1976]), Cartilage from this recessively inherited mutant phenotype shows 
small, diffuse proteoglycan granules and reduced staining for sulfated 
glycosaminoglycans, consistent with a defect affecting sulfation of the proteoglycans of 
the cartilage extracellular matrix. (Lane, P. & Dickie, M.M., Three recessive mutations 
5 producing diproportionate dwarfing in mice: achondroplasia, brachymorphic, and 
stubby, J. Hered. 59:300-08 [1968]). In brachymorphic mice, proteoglycan granules show 
a 50% reduction in size in the reserve zone of the growth plate matrix, and are difficult to 
identify in the proliferative and hypertrophic zones. (Orkin, R.W., et al, Undersulfated 
chondroitin sulfate in the cartilage matrix of brachymorphic mice, Dev. Biol. 50:82-94 
10 [1976]; Orkin, R.W., et a!., Defects in the cartilaginous growth plates of brachymorphic 
mice, J. Cell Biol. 73:287-99 [1977]). This suggests that reduced proteoglycan sulfation 
affects the signals that regulate growth plate chondrocyte maturation. Heparan sulfate 
i proteoglycans have been implicated in the sequestration and presentation of growth 

| factors, particularly fibroblast growth factors, to receptors at the cell surface. (Rapraeger, 

g 15 A.C., et al, Requirement of heparan sulfate for bFGF-mediated fibroblast growth and 
'!! myoblast differentiation, Science 252:1705-08 [1991]; Yayon, A., et al., Cell surface, 

Jl heparin-like molecules are required for binding of basic Fibroblast Growth Factor to its 

Q high-affinity receptor, Cell 64:841-48 [1991]; Schlessinger, J., et ah, Regulation of 

..^ growth factor activation by proteoglycans: What is the role of the low affinity receptors?, 

Hi 20 Cell 83:357-60 [1995]). Skeletal defects in mice lacking heparan sulfate 2- 
Q. sulfotransferase include dwarfism, with shortened long bones, ribs and spine, implying a 

specific role for heparan sulfate proteoglycans in skeletal development. (Bullock, S.L, et 
ah, Renal agenesis in mice homozygous for a gene trap mutation in the gene encoding 
heparan sulfate 2-sulfotransferase, Genes Dev. 12:1894-1906 [1998]). 

25 Indeed, reduced activities of both ATP sulfurylase and APS kinase, and 

decreased synthesis of chondroitin sulfate, have been demonstrated in bm mice. 
(Schwartz, N.B., et ah, Defective PAPS-synthesis in epiphyseal cartilage from 
brachymorphic mice, Biochem. Biophys. Res. Commun. 82:173-78 [1978]; Sugahara, K. 
& Schwartz, N.B., Defect in 3'-phosphoadenosine 5-phosphosulfate formation in 

30 brachymorphic mice. Proa Natl. Acad. Set U.S.A. 76:6615-18 [1979]). The ATP 
sulfurylase and APS kinase activities in brachymorphic mouse tissues co-purified. 
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(Geller, D.H., et al [1987]; Lyle, S., et al, Rat chondrosarcoma ATP sulfurylase and 
adenosine 5-phosphosulfate kinase reside on a single bifunctional protein, Biochemistry 
33:5920-25 [1994]). The functional activity of the APS kinase was reduced to a greater 
extent than was that of the ATP sulfurylase, suggesting that brachymorphism resulted 
5 from a structural mutation that affected channeling APS from the carboxyl-terminal 
sulfurylase activity to the amino-terminal kinase. (Lyle, S., et al, Sulfate-activating 
enzymes in normal and brachymorphic mice: evidence for a channeling defect, 
Biochemistry 34:940-45 [1995]). 

A cDNA encoding a murine bifunctional ATP-sulforylase/APS-kinase 
10 enzyme (known as Papssl, formerly known as Atpskl) was isolated from brachymorphic 
mice and studied by mutational analysis. (Li, H. et al, The isolation and characterization 
of cDNA encoding the mouse bifunctional ATP sulfurylase-adenosine 5 f -phosphosulfate 
kinase, J. Biol. Chem. 270(49):29453-59 [1995]; Deyrup, A.T. et al, Deletion and site- 
directed mutagenesis of the ATP-binding motif (P-loop) in the bifunctional murine ATP- 
15 sulfurylase/adenosine S'-phosphosulfate kinase enzyme, J. Biol. Chem. 273(16):9450-56 
[1998]). Subsequently, bifunctional enzymes having ATP-sulfurylase (E.C. 
2.7.7.4)/APS-kinase (E.C. 2.7 A 25) activities were designated "PAPS synthetase" 
^ (PAPSS; also known as ASAPK and ATPSK). 



IS 

'hub? 



The bm mutation was placed on the mouse phenotypic map by Lane and 
20 Dickie (1968). Subsequent application of microsatellite markers to mice from a large 
backcross localized the bm mutation to a 2.5 cM region located at approximately 32 cM 
on mouse chromosome 19. (O'Brien, E.P., et al, Molecular map of chromosome 19 
including three genes affecting bleeding time: ep, ru and bm, Mamm. Genome 5:356-60 
[1994]; Rusiniak, M.E., et al [1996]). In addition, a liver cancer susceptibility gene, 
25 Hcs6, has been localized near bm in mouse strain C3H/He. (Manenti, G., et al [1994]). 

The bm mutation co-localized with a PAPSS gene. (Kurima, K. et al, A 
member of a family of sulfate-activating enzymes causes murine brachymorphism, Proc. 
Natl. Acad. Sci. 95(15):8681-85 [1998]; Sugahara, K. and Schwartz, KB., Defect in 3'- 
phosphoadenosine 5 f -phosphosulfate formation in brachymorphic mice, Proc. Nat. Acad. 

6 
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Sci. 76(12):6615-18 [1979]). Although brachymorphic mouse liver shows decreased 
PAPSS activity, the brachymorphic phenotype does not manifest any grossly 
recognizable liver defects. It has been observed that brachymorphic mouse liver has a 
decreased ability to esterify xenobiotic carcinogens with sulfate (Lyman, S.D. & Poland, 
A., Effect of the brachymorphic trait in mice on xenobiotic sulfate ester formation, 
Biochem, Pharmacol. 32:3345-50 [1983]), which implies that altered susceptibility to 
such agents could result from defects in a PAPSS gene. Consistent with this hypothesis, 
brachymorphic mice have decreased susceptibility to carcinogen-induced 
hepatocarcinoma, presumably due to a decreased ability to generate sulfated carcinogens. 
(Boberg, E.W., et al, Strong evidence from studies with brachymorphic mice and 
pentachlorophenol that l f -sulfooxysafrole is the major ultimate electrophilic and 
carcinogenic metabolite of l f -hydroxysafrole in mouse liver, Cancer Res. 43:5163-73 
[1983]; Lai, C.C., et al, Initiation of hepatocarcinogenesis in infant male B6C3F1 mice 
by N-hydroxy-2-aminofluorene depends primarily on metabolism to N-sulfooxy-2- 
aminofluorene and formation of DNA-(deoxyguanosin-8-yl)-2-aminofluorene adducts, 
Carcinogenesis 8:471-78 [1987]). 

PAPSS enzymes have been isolated and characterized in non-murine 

systems, for example, in association with rat chondrosarcomas. (Rosenthal, E. and 

Leustek, T., A multifunctional Urechis caupo protein, PAPS synthetase, has both ATP 

sulfurylase and APS kinase activities, Gene 165(2):243-48 [1995]; Schwartz, N.B., et al., 

Sulfate activation and transport in mammals; system components and mechanisms,, 

Chem. Biol Interact. 109(1-3):143-51 [1998]; Lyle, S. et al, Rat chondrosarcoma ATP 

sulfurylase and adenosine 5 '-phosphosulfate kinase reside on a single bifunctional 

protein, Biochemistry 33(19):5920-25 [1994]; Lyle, S. et ah, Intermediate channeling 

between ATP sulfurylase and adenosine 5 f -phosphosulfate kinase from rat 

chondrosarcoma, Biochemistry 33(22):6822-27 [1994]). Human cDNAs for PAPSS 

have also been cloned, and their activities have been analyzed biochemically and 

mutationally. (Yanagisawa, K. et al, cDNA cloning, expression, and characterization of 

the human bifunctional ATP sulfurylase/ adenosine 5-phosphosulfate kinase enzyme, 

Biosci. Biotechnol. Biochem. 62(5):1037-40 [1998]; Girard, J.P., et al, Sulfation in high 

endothelial venules: cloning and expression of the human PAPS synthetase, FASEB J. 

7 
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12(7):603-12 [1998]; Venkatachalam, K.V. et al, Molecular cloning, expression, and 
characterization of human bifunctional 3-phosphoadenosine 5 '-phosphosulf ate synthase 
and Us functional domains, J. Biol Chem. 273(30):19311-20 [1998]; Ventkatachalam, 
K.V., et ah, Site-selected mutagenesis of a conserved nucleotide binding HXGH motif 
5 located in the ATP sulfurylase domain of human bifunctional 3 f -phosphoadenosine 5- 
phosphosulfate synthase, L BioL Chem. 274(5):2601-04 [1999]). 



Phenotypically analogous to murine brachymorphism, is 
spondyloepimetaphyseal dysplasia (SEMD) in humans. SEMD is a subgroup of 
osteochondrodysplasias affecting skeletal development, linear bone and cartilage growth, 
10 and bone and cartilage maintenance. Effects of SEMD can include dwarfism, stunted or 
malformed limbs, enlarged joints, kyphoscoliosis (spinal warping), and brachydactyly 
(short fingers and toes). SEMD typically runs in families, and can be inherited in 
^ autosomal dominant or recessive manners. (Ahmad, M. et al, Distinct, autosomal 

'sip? 

CO recessive form of spondyloepimetaphyseal dysplasia segregating in an inbred Pakistani 

J 15 kindred, Am. J. Med. Genet. 78(5):468-73 [1998]; Figuera, L.E., et al, 
!aH Spondyloepimetaphyseal dysplasia (SEMD) Shohat type, Am. J. Med. Genet. 51(3);213- 

Q 15 [1994]; Shohat, M. et ah, New form of spondyloepimetaphyseal dysplasia (SEMD) in 

q Jewish family of Iraqi origin, Am. J. Med. Genet. 46(4):358-62, [1993]; Whyte, M.P. et 

; b 2 cd<-> Hypotrichosis with spondyloepimetaphyseal dysplasia in three generations; a new 

H 20 autosomal dominant syndrome, Am. J. Med. Genet. 36(3):288-91 [1990]; Gertner, J.M., 
et ah, Linkage studies of a Missouri kindred with autosomal dominant 
spondyloepimetaphyseal dysplasia (SEMD) indicate genetic heterogeneity, J. Bone 
Miner. Res. 12(8):1204-9, [1997]). 



A nonsense mutation in a novel, cartilage-specific human PAPSS 

25 (PAPSS2, formerly ATPSK2), has been indicated as the cause of a recessive form of 

SEMD in an inbred Pakistani family. (Ahmad, M., et al, A distinct, autosomal recessive 

form of spondyloepimetaphyseal dysplasia segregating in an inbred Pakistani kindred, 

Am. J. Med. Genet. 78:468-73 [1998]; ul Haque, M.F., et al, Mutations in orthologous 

genes in human spondyloepimetaphyseal dysplasia and the brachymorphic mouse, Nat. 

30 Genet. 20(2): 157-62 [1998]). Genome-wide linkage studies localized the disease gene for 

8 
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this dwarfing condition to chromosome 10q23-24, in a region syntenic with the locus for 
the bm locus on mouse chromosome 19. (ul Haque, M.F., et al [1998]). This disorder, 
designated SEMD Pakistani type, is characterized by short, bowed lower limbs, enlarged 
knee joints, kyphoscoliosis, a mild generalized brachydactyly, and early-onset 
degenerative joint disease in the hands and knees. Radiographs of patients with SEMD 
Pakistani type show delayed epiphyseal ossification, especially at the hips and knees, and 
platyspondyly. 

Currently, there are only a few methods of detecting bone related diseases. 
(Eg., Klock, J.C., Chondroitin sulfate as a marker for bone resorption, U.S. Patent No. 
5,869,273; Takeshita, S. et al, Bone-related sulfatase-like protein and process for its 
production, U.S. Patent No. 5,627,050). DNA-based diagnostic approaches have been 
suggested for some type 2 collagen disorders, such as Stickler syndrome, 
spondyloepiphyseal dysplasia, and achondrogenesis; achondroplasia (a defect in the 
fibroblast growth factor receptor 3 (FGFR3) gene); the collagen oligomeric matrix protein 
(COMP) disorders pseudoachondroplasia and multiple epiphyseal dysplasia, and others. 
(E.g., Ritvaniemi, Arthritis and Rheumatism 38:999-1004 [1995]; Shiang et al, 
Mutations in the transmembrane domain of FGFR3 cause the most common genetic form 
of dwarfism, achondroplasia, Cell 78:335-42 [1994]; Briggs et al, 
Pseudoachondroplasia and multiple epiphyseal dysplasia due to mutations in the 
cartilage of oligomeric matrix protein gene, Nature Genetics 10:330-36 [1995]; Superti- 
Ferga et al, Recessively inherited multiple epiphyseal dysplasia with normal stature, club 
foot and double layered patella caused by DTDST mutation, J. Med. Genet. 36:621-24 
[1999]). But most reported diagnostic methods relate to osteogenic proteins. (E.g., 
Khandke, K. M., et al, Chromatographic process for the copurification of chondroitinase 
I and II proteins from Proteus vulgaris, U.S. Patent No. 5,525,500; Parsons, T. F., et al, 
Osteogenic Factors, U.S. Patent No. 5,106,626; Oppermann, H., et al, Cartilage and 
bone-inducing proteins, U.S. Patent No. 5,750,651). 

Accordingly, a reliable method is still needed for diagnosing SEMD, 
detecting the presence of SEMD in recessive carriers, and for treating osteoarthritic 
disorders, including osteochondrodysplasias, that are caused or aggravated by deficient 

9 
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The present invention provides these and other advantages, as 



SUMMARY OF THE INVENTION 

The present invention relates to an isolated polynucleotide or to a nucleic 
acid construct that comprises a nucleic acid segment encoding a 3'-phosphoadenosine-5'- 
phosphosulfate (PAPS) synthetase (PAPSS), particularly, a human PAPSS2 nucleotide 
sequence of (SEQ. ID. NO.:l) or an orthologous murine Papss2 nucleotide sequence 
(SEQ. ID. NO.:2), sequences complementary to either one of them, degenerate coding 
sequences, or gene-specific fragments of (SEQ. ID. NOS.:l and 2). The present 
polynucleotides and nucleic acid constructs containing PAPSS2 and Papss2 nucleotide 
sequences include RNA; DNA; and chimeric RNA/DNA. Embodiments include probes, 
primers, and expression vectors containing PAPSS2- and Papss2-specific nucleotide 
sequences. 

The present invention also relates to a genetically modified vertebrate cell 
containing a nucleic acid construct of the present invention and to a non-human vertebrate 
containing the cell. The present invention also relates to human PAPSS2 and murine 
Papss2 proteins encoded by the present polynucleotide or nucleic acid construct, 
including fusion proteins that contain, together with a PAPSS2 or Papss2 amino acid 
sequence, any other predetermined polypeptide sequence. The present invention also 
relates to antibodies and antibody fragments that selectively bind the PAPSS2 or Papss2 
protein. 

The present invention is also directed to a method of diagnosing 
spondyloepimetaphyseal dysplasia (SEMD) in a human subject. The method involves 
amplifying nucleic acids from a sample that define an PAPSS2 gene sequence, or a gene- 
specific fragment thereof; and analyzing the amplified nucleic acids for the presence of 
homozygosity for a variant allele of PAPSS2. The sample is a bodily substance 
containing human nucleic acid, for example a blood sample, obtained from a human 
subject having at least one symptom of SEMD. Homozygosity for a variant allele of 
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PAPSS2 is diagnostic for SEMD in the human subject. In particular, the present 
invention also relates to a method of diagnosing SEMD Pakistani-type in a human 
subject. But the present invention also relates to a method of identifying a heterozygous 
human carrier of an SEMD-associated allele. 

The present invention provides a genetic testing kit for diagnosing SEMD 
or for identifying a human carrier of SEMD, including SEMD Pakistani-type. The 
genetic testing kit contains oligonucleotide primers of the present invention. 

Also, nucleic acid constructs of the present invention are used in a method 
of gene therapy for treating a human subject having an osteoarthritic disorder that is 
caused or aggravated by deficient enzymatic sulfation activity. The present invention is 
also related to a protein therapy method for treating a human subject having an 
osteoarthritic disorder that is caused or aggravated by deficient enaymatic sulfation 
activity. This method employs the inventive PAPSS2 fusion protein. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a genetic map of the interval containing the SEMD disease 
gene on human chromosome 10. Distances between loci are given in centimorgans (cM). 

Figure 2 shows a genetic map of the bm region of mouse chromosome 19. 
The 0.7 cM interval containing the disease gene and locus D19Mitl3 is shown with a 
darker line. 

Figure 3 shows a variant allele of PAPSS2 associated with SEMD 
Pakistani type. Sequences derived from amplified DNA fragment from an affected 
family member (SEMD) and from a control (NL) are shown. An arrow at nucleotide 
+1424 marks the location of a mutation, and the DNA sequence and the implied effect of 
the mutation on the PAPSS2 protein sequence is summarized below the nucleotide 
sequence. 
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Figure 2 shows a genetic map of the bm region of mouse chromosome 19. 
The 0.7 cM interval containing the disease gene and locus D19MU1S is shown with a 
darker line. 

Figure 3 shows a variant allele of PAPSS2 associated with SEMD 
Pakistani type. Sequences derived from amplified DNA fragment from an affected 
family member (SEMD) and from a control (NL) are shown. An arrow at nucleotide 
+1424 marks the location of a mutation, and the DNA sequence and the implied effect of 
the mutation on the PAPSS2 protein sequence is summarized below the nucleotide 
sequence. 

Figure 4 compares expression of PAPSS1 and PAPSS2 mRNAs with an 
actin mRNA control in samples from various human tissues, as visualized by Northern 
hybridization with specific cDNA probes for each. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention relates to an isolated polynucleotide, of natural 
origin or a nucleic acid construct, that comprises a nucleic acid segment encoding an 
ATP sulfurylase/APS kinase, i.e., a PAPS synthetase (PAPSS), particularly, a human 
PAPSS2 nucleotide sequence of (SEQ. ID. NO.:l) or an orthologous murine Papss2 
nucleotide sequence (SEQ. ID. NO.:2). 

The human PAPSS2 gene is located on chromosome 10 at locus 10q23-24. 
(ul Haque, M.F., et al [1998]). More specifically, the PAPSS2 gene region is flanked by 
microsatellite markers D10S1143 and D10S2470. (See Figure 1; C. Dib et aL, A 
comprehensive genetic map of the human genome based on 5,264 microsatellites, Nature 
380(6570): 152-54 [1996]). The cloned human PAPSS2 nucleotide sequence is the 
following (GenBank accession AF091242; transcriptional start site [underlined] 
commences at position +1): 

-80 CTGCTGCCGC CGCCGCCGCC GCCGTCCCTG CGTCCTTCGG TCTCTGCTCC CGGGACCCGG 
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-20 CTCCGCCGCA GCCAGCCAGC ATGTCGGGGA TCAAGAAGCA AAAGACGGAG AACCAGCAGA 

+4 1 AATCCACCAA TGTAGTCTAT CAGGCCCACC ATGTGAGCAG GAATAAGAGA GGGCAAGTGG 

+ 10 1 TTGGAACAAG GGGTGGGTTC CGAGGATGTA CCGTGTGGCT AACAGGTCTC TCTGGTGCTG 

+ 161 GAAAAACAAC GATAAGTTTT GCCCTGGAGG AGTACCTTGT CTCCCATGCC ATCCCTTGTT 

+22 1 ACTCCCTGGA TGGGGAC AAT GTCCGTCATG GCCTTAACAG AAATCTCGGA TTCTCTCCTG 

+28 1 GGGACAGAGA GGAAAATATC CGCCGGATTG CTGAGGTGGC TAAGCTGTTT GCTGATGCTG 

+34 1 GTCTGGTCTG CATTACCAGC TTTATTTCTC CATTCGCAAA GGATCGTGAG AATGCCCGC A 

+40 1 AAATACATGA ATCAGCAGGG CTGCCATTCT TTGAAATATT TGTAGATGCA CCTCTAAATA 

+46 1 TTTGTGAAAG CAGAGACGTA AAAGGCCTCT ATAAAAGGGC CAGAGCTGGG GAGATTAAAG 

+52 1 GATTTACAGG TATTGATTCT GATTATGAGA AACCTGAAAC TCCTGAGCGT GTGCTTAAAA 

+58 1 CCAATTTGTC CACAGTGAGT GACTGTGTCC ACCAGGTAGT GGAACTTCTG CAAGAGCAGA 

+64 1 ACATTGTACC CTATACTATA ATCAAAGATA TCCACGAACT CTTTGTGCCG GAAAACAAAC 

+70 1 TTGACCACGT CCGAGCTGAG GCTGAAACTC TCCCTTCATT ATCAATTACT AAGCTGGATC 

+76 1 TCCAGTGGGT CCAGGTTTTG AGCGAAGGCT GGGCCACTCC CCTCAAAGGT TTCATGCGGG 

+82 1 AGAAGGAGTA CTTACAGGTT ATGCACTTTG ACACCCTGCT AGATGATGGC GTGATCAACA 

+88 1 TGAGCATCCC CATTGTACTG CCCGTCTCTG CAGAGGATAA GACACGGCTG GAAGGGTGCA 

+94 1 GCAAGTTTGT CCTGGCAC AT GGTGGACGGA GGGTAGCTAT CTTACGAGAC GCTGAATTCT 

+1001 ATGAACAC AG AAAAGAGGAA CGCTGTTCCC GTGTTTGGGG GACAAC ATGT AC AAAACACC 

+1061 CCCATATC AA AATGGTGATG GAAAGTGGGG ACTGGCTGGT TGGTGGAGAC CTTCAGGTGC 

+1121 TGGAGAAAAT AAGATGGAAT GATGGGCTGG ACCAATACCG TCTGACACCT CTGGAGCTCA 

+1181 AACAGAAATG TAAAGAAATG AATGCTGATG CGGTGTTTGC ATTCCAGTTG CGCAATCCTG 

+ 1 24 1 TCCAC AATGG CCATGCCCTG TTGATGCAGG ACACCTGCCG CAGGCTCCTA GAGAGGGGCT 

+1301 ACAAGC ACCC GGTCCTCCTA CTACACCCTC TGGGCGGCTG GACC AAGGAT GACGATGTGC 

+1361 CTCTAGACTG GCGGATGAAG CAGC ACGCGG CTGTGCTCGA GGAAGGGGTC CTGGATCCCA 

+142 1 AGTCAACCAT TGTTGCCATC TTTCCGTCTC CCATGTTATA TGCTGGCCCC ACAGAGGTCC 

+1481 AGTGGCACTG CAGGTCCCGG ATGATTGCGG GTGCCAATTT CTACATTGTG GGGAGGGACC 

+1541 CTGC AGGAAT GCCCC ATCCT GAAACCAAGA AGGATCTGTA TGAACCC ACT CATGGGGGC A 

+1601 AGGTCTTGAG CATGGCCCCT GGCCTCACCT CTGTGGAAAT CATTCCATTC CGAGTGGCTG 

+1661 CCTAC AACAA AGCCAAAAAA GCC ATGGACT TCTATGATCC AGC AAGGCAC AATGAGTTTG 

+ 1 72 1 ACTTC ATCTC AGGAACTCGA ATGAGGAAGC TCGCCCGGGA AGGAGAGAAT CCCCCAGATG 

+1781 GCTTCATGGC CCCC AAAGC A TGGAAGGTCC TGAC AGATTA TTACAGGTCC CTGGAGAAGA 

+1841 ACTAAGCCTT TGGGTCCAGA GTTTCTTTCT GAAGTGCTCT TTGATTACCT TTTCTATTTT 

+1 90 1 TATGATTAGA TGCTTTGTAT TAAATTGCTT CTCA// (SEQ. ID. NO. : 1). 

Preferably, the nucleic acid segment encoding a human PAPS synthetase, 
specifically PAPSS2, comprises a nucleotide sequence defining an open reading frame 
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within SEQ. ID. NO.:l that extends from nucleotide position +1 through +1845 (SEQ. 
ID. NO.:9). 



The orthologous murine Papss2 gene is located in the bm region of mouse 
chromosome 19, within a 0.7 cM interval near marker D19MU13. (See, Figure 2; ul 
5 Haque, M.F., et al [1998]). The cloned murine Papss2 nucleotide sequence is the 
following (GenBank accession AF085144; transcriptional start site [underlined] 
commences at position +1): 



CO 



: 3 j 3 

■ih 



-60 GTATTCTCAA CATCAGATAT CATGTCTTGG AGGAAGTTAC CTAAACTCTG AAGAATTATC 

+ 1 ATGTCTGCAA ATTTCAAAAT GAACC ATAAA AGAGACC AGC AAAAATCC AC CAATGTGGTC 

10 +61 TACCAGGCCC ATCATGTGAG CAGGAACAAG AGAGGACAAG TGGTTGGAAC CAGGGGAGGA 

+ 121 TTCCGAGGAT GTACCGTGTG GCTAACAGGT CTCTCTGGTG CTGGGAAAAC AACC ATAAGC 

+181 TTTGCTTTGG AAGAGTACCT TGTATCTC AC GCCATCCC AT GTTACTCCCT GGATGGGGAC 

+24 1 AATGTCCGTC ATGGCCTTAA TAAGAACCTG GGATTCTCTG CCGGGGACCG AGAAGAGAAT 

1 +301 ATCCGCCGGA TCGCGGAGGT GGCCAAGCTC TTTGCCGACG CCGGCCTGGT TTGCATCACC 

^ 15 +361 AGCTTTATCT CTCCTTTTGC AAAGGATCGT GAGAATGCCC GAAAAATCCA CGAATCAGCA 

'4 

ls 1 +42 1 GGACTCCCGT TCTTTGAGAT CTTTGTAGAT GCGCCTTTAA ATATCTGTGA AAGCCGAGAC 

+48 1 GTAAAAGGAC TCTACAAACG AGCCCGAGCA GGAGAGATTA AAGGGTTTAC AGGCATCGAT 

,aP ! 

2 +54 1 TCTGACTATG AGAAACCTGA AACTCCAGAG TGTGTGCTGA AGACCAACTT GTCTTCAGTA 
+60 1 AGCGACTGTG TGCAAC AGGT GGTGGAACTT TTGCAGGAGC AGAAC ATTGT ACCCCACACC 

20 +66 1 ACC ATCAAAG GC ATCCACGA ACTCTTTGTG CCAGAAAACA AAGTCGATCA AATCCGAGCT 

+72 1 GAGGC AGAGA CTCTCCCATC ACTACCAATT ACCAAGCTGG ATCTGCAGTG GGTGCAGATT 

+78 1 CTGAGTGAAG GCTGGGCCAC TCCCCTCAAA GGCTTTATGC GGGAGAAGGA ATACTTGCAA 

+84 1 ACTCTACACT TCGACACTCT ACTGGACGAT GGAGTCATCA ACATGAGTAT TCCC ATTGTA 

+90 1 TTGCCCGTTT CTGCGGATGA CAAGGCACGG CTCGAAGGGT GCAGCAAATT TGCCTTGATG 

25 +96 1 TACGAAGGTC GGAGGGTCGC TCTATTACAG GACCCTGAAT TCTATGAGCA TAGGAAAGAG 

+1021 GAGCGTTGTT CTCGTGTGTG GGGAACAGCC ACTGCAAAGC ACCCCCATAT CAAAATGGTG 

+ 1081 ATGGAAAGTG GGGACTGGCT TGTTGGTGGA GACCTACAGG TGCTAGAGAG AATAAGGTGG 

+1141 GACGATGGGC TGGACCAATA CCGCCTTACG CCTCTGGAAC TCAAACAGAA GTGTAAAGAC 

+ 1201 ATGAATGCTG ATGCCGTGTT TGCATTCC AG TTGCGCAATC CTGTCCAC AA TGGTCATGCC 

30 + 126 1 CTCCTGATGC AGGACACCCG CCGCAGGCTC CTGGAGAGGG GTTACAAGCA CCCAGTCCTC 

+ 1321 CTGCTCCACC CTCTTGGGGG CTGGACCAAG GACGATGACG TACCTCTGGA ATGGAGGATG 

+ 1381 AAAC AGC ATG C AGCTGTACT GGAGGAAAGG GTCCTGGATC CCAAGTCAAC TATTGTTGCC 
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+ 1 44 1 ATCTTTCC AT CTCCTATGTT ATACGCTGGT CCCACAGAGG TCCAGTGGCA TTGCAGATGC 
+1501 CGGATGATTG C AGGAGCCAA TTTCTACATT GTGGGTAGGG ATCCCGC AGG AATGCCCCAT 
+ 1561 CCTGAGAC AA AGAAAGACCT ATATGAACCC ACCCACGGGG GCAAGGTCTT GAGTATGGCC 
+ 1 62 1 CCTGGCCTTA CCTCTGTGGA AATAATTCCG TTCCGAGTGG CTGCCTACAA TAAAATTAAA 
5 +1681 AAGGCCATGG ACTTTTATGA TCCAGCAAGG CACGAGGAGT TTGACTTCAT CTCAGGAACT 
+ 1 74 1 CGCATGAGGA AGCTCGCCCG GGAAGGAGAA GATCCCCCAG ATGGCTTCAT GGCCCCGAAA 
+ 1801 GCGTGGAAAG TGTTGACAGA TTACTACAGG TCTCTGGAGA AGACCAACTA GGTGCTCCTG 
+ 1861 GCTCTGGCTT CTTCCTCAAG TGCTCTCTGA CGATTTTTTT TTTCTATTTT TGTGATTTAG 
+1921 CTGCTCTGTA TCCAATTGCA //(SEQ. ID. NO.:2). 

10 Preferably, the nucleic acid segment encoding a murine PAPS synthetase, 

specifically Papss2, comprises a nucleotide sequence defining an open reading frame 
within SEQ. ID. NO.:2 that extends from nucleotide position +1 through +1851 (SEQ. 
ID. NO : 10). 



In other embodiments, the nucleic acid segment has a nucleotide sequence 
15 complementary to SEQ. ID. NOS.:l, 2, 9, or 10, or comprises a degenerate coding 
sequence of any of these. Alternatively, the present nucleic acid segment is a gene- 
Q specific fragment of SEQ. ID. NOS.:l, 2, 9 or 10, or a gene-specific fragment of a 

complementary sequence or degenerate coding sequence. Gene-specific fragments are 
nucleic acid segments having a contiguous nucleotide sequence that is specific to PAPSS2 
20 or Papss2, respectively. For example, a gene-specific fragment of SEQ. ID. NO.:l is 
SEQ. ID. NO.:9, and a gene-specific fragment of SEQ. ID. NO.:2 is SEQ. ID. NO : 10. 
The skilled artisan can readily determine other gene-specific fragments by conducting a 
sequence similarity search of a genomics data base, such as the GenBank database of the 
National Center for Biotechnology Information (NCBI), using a computerized algorithm, 
25 such as PowerBLAST, QBLAST, PSI-BLAST, PHI-BLAST, gapped or ungapped 
BLAST, or the "Align" program through the Baylor College of Medicine server. (E.g., 
Altchul, S.F., et al, Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs, Nucleic Acids Res. 25(17):3389-402 [1997]; Zhang, I, & 
Madden, T.L., PowerBLAST: a new network BLAST application for interactive or 
30 automated sequence analysis and annotation, Genome Res. 7(6):649-56 [1997]; Madden, 
T.L., et aL, Applications of network BLAST server, Methods Enzymol. 266:131-41 
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[1996]; Altschul, S.F., etal, Basic local alignment search tool, J. MoL Biol. 215(3):403- 
10 [1990]). Preferably, a gene specific fragment is at least 5 to 30 contiguous 
nucleotides long, more preferably at least 6 to 15 contiguous nucleotides long, and most 
preferably at least 7 to 10 contiguous nucleotides long. Embodiments of the isolated 
polynucleotide or nucleic acid construct of the present invention include nucleic acid 
segments comprising RNA, DNA, or chimeric RNA/DNA. 

RNA segments include messenger RNA encoding an PAPSS2 or Papss2 
protein, or a degenerate coding sequence, or a RNA sequence complementary to any of 
these. A complementary RNA segment can be an antisense RNA or a catalytic RNA 
sequence, such as a "hairpin" or "hammerhead" ribozyme, with a recognition sequence 
complementary to a gene-specific fragment of SEQ. ID. NOS.:l, 2, 9 or 10, or a 
degenerate coding sequence of any of these. 

Other embodiments of the isolated polynucleotide or nucleic acid construct 
include nucleic acid segments of DNA. In some embodiments, the DNA segment 
encoding a PAPSS2 or Papss2 protein is operatively linked to a promoter sequence. The 
promoter sequence can be an autologous PAPSS2 or Papss2 promoter sequence. 
Alternatively, in a nucleic acid construct of the present invention, any suitable promoter 
sequence can be used, including an autologous promoter, a promoter from a heterologous 
human or non-human PAPS synthetase gene (e.g., PAP SSI or Papssl promoters), or a 
predetermined promoter sequence from any other human or non-human gene. Useful 
promoter sequences include constitutive promoters, such as, but not limited to, 
cytomegalovirus (CMV) promoter, or inducible promoters, such as, but not limited to, the 
human C-reactive protein (CRP) promoter (e.g., Kanzler, S., et al 7 TGF-betal in liver 
fibrosis: an inducible transgenic mouse model to study liver fibrogenesis, Am. J. Physiol. 
276(4Pt l):G1059-68 [1999]), or the insulin-like growth factor (IGF-I) promoter (e.g., 
Meton I, et al, Growth hormone induces insulin-like growth factor-I gene transcription 
by synergistic action of STAT5 and HNF-lalpha, FEBS Lett. 444(2-3): 155-59 [1999]). 
Useful promoters include those that promote transcription in cells of diverse tissues, such 
as, but not limited to, an insulin receptor (IR) gene promoter (e.g., Tewari, D.S., et al^ 
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Characterization of the promoter region and 3' end of the human insulin receptor gene, J. 
Biol. Chem. 264(27): 16238-45 [1989]); growth hormone receptor (GHR) P2 or P3 
promoters (e.g., Jiang, H., et aL, Isolation and characterization of a novel promoter for 
the bovine growth hormone receptor gene, J. Biol. Chem. 274(12):7893-900 [1999]); or a 
leptin promoter (e.g., Chen, X.L., et aL, Analysis of a 762-bp proximal leptin promoter to 
drive and control regulation of transgene expression of growth hormone receptor in mice, 
Biochem. Biophys. Res. Commun. 262(1): 187-92 [1999]). 

Also useful for various applications are tissue-selective promoters, i.e., 
promoters from which expression occurs preferentially in cells of a particular kind of 
tissue, compared to one or more other types of tissue. These include, cartilage-selective 
promoters for expression in chondrocytes, for example, an osteocalcin (OC) promoter 
(e.g., Newberry, E.P., et aL, The RRM domain of MINT, a novel Msx2 binding protein, 
recognizes and regulates the rat osteocalcin promoter, Biochemistry 38(33): 10678-90 
[1999]); a SOX9 promoter, aggrecan gene promoter (AGC1), or collagen oligomeric 
matrix protein (COMP) gene promoter (e.g., Kanai, Y. & Koopman, P., Structural and 
functional characterization of the mouse Sox9 promoter: implications for campomelic 
dysplasia, Hum. Mol. Genet. 8(4):691-96 [1999]; Newton et al, Characterization of 
human and mouse cartilage oligomeric matrix protein, Genomics 24:435-39 [1994); or a 
promoter from a collagen gene, such as, but not limited to promoters for COL2A1, 
COL9A1, or COL10A1. (e.g., Ganguly, A, et aL, Targeted insertions of two exogenous 
collagen genes into both alleles of their endogenous loci in cultured human cells: the 
insertions are directed by relatively short fragments containing the promoters and the 5' 
ends of the genes, Proc Natl Acad Sci USA 91(15):7365-9 [1994]; Dharmavaram, R. M, 
et aL, Detection and characterization of Spl binding activity in human chondrocytes and 
its alterations during chondrocyte dedifferentiation, J. Biol Chem 272(43):26918-25 
[1997]; Zhou, G, et aL, Three high mobility group-like sequences within a 48-base pair 
enhancer of the CoUal gene are required for cartilage-specific expression in vivo, J. Biol 
Chem 273(24): 14989-97 [1998]; Seghatoleslami, M. R., etaL, Differential regulation of 
COL2A1 expression in developing and mature chondrocytes, Matrix Biol 14(9): 753 -64 
[1995]; Lefebvre, V., et al., An 18-base-pair sequence in the mouse proalphal(II) 
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collagen gene is sufficient for expression in cartilage and binds nuclear proteins that are 
selectively expressed in chondrocytes, Mol Cell Biol 1 6(8) :45 12-23 [1996]; Zhou, G., et 
al, A 182 bp fragment of the mouse pro alpha 1(11) collagen gene is sufficient to direct 
chondrocyte expression in transgenic mice, J. Cell Sci, 108(Pt 12): 3 677-84 [3677-84]; 
Mukhopadhyay, K., et al y Use of a new rat chondrosarcoma cell line to delineate a 119- 
base pair chondrocyte-specific enhancer element and to define active promoter segments 
in the mouse pro-alpha 1(11) collagen gene, J. Biol Chem 270(46) :277 11-9 [1995]; 
Vikkula, M., et al, Structural analysis of the regulatory elements of the tyupe-II 
procollagen gene. Conservation of promoter and first intron sequences between human 
and mouse, Biochem J 285(Pt l):287-94 [1992]; Beier, F., et al, Localization of silencer 
and enhancer elements in the human type X collagen gene, J Cell Biochem 662(2) :2 10-8 
[1997]; Thomas, J. T. s Sequence comparison of three mammalian type-X collagen 
promoters and preliminary functional analysis of the human promoter, Gene 160(2):291- 
6 [1995]; Apte, S. S., Characterization of the mouse type X collagen gene, Matrix 
13(2): 165-79 [1993]). A cartilage-derived retinoic acid-sensitive protein (CD-RAP) 
gene promoter is also useful for cartilage-selective expression by chondrocytes, (e.g., 
Xie, W.F., et al, Transactivation of the mouse cartilage derived retinoic acid-sensitive 
protein gene by Sox9, J. Bone Miner. Res. 14(5):757-63 [1999]). 

For liver-selective expression in hepatocytes, useful promoter sequences 
include, an albumin gene promoter (e.g., Pastore, L., et al, Use of a liver-specific 
promoter reduces immune response to the transgene in adenoviral vectors, Hum. Gen. 
Ther. 10(11): 1773-81 [1999]); a CYP7A or CYP7A1 promoter (e.g., Nitta, M, et al, 
CPF: an orphan nuclear receptor that regulates liver-specific expression of the human 
cholesterol 7alpha-hydroxylase gene, Proc. Natl. Acad. Sci. USA 96(12):6660-65 [1999]; 
Chen, J., et al, Hepatocyte nuclear factor 1 binds to and transactivates the human but not 
the rat CYP7A1 promoter, Biochem. Biophys. Res. Commun. 260(3):829-34 [1999] ); a 
GHR PI promoter (e.g., Zou, L. ? et al, Isolation of a liver-specific promoter for human 
growth hormone receptor gene, Endocrinology 138(4): 1771-74 [1997]; Jiang, H., et al 
[1999]; Adams, T.E., Differential expression of growth hormone receptor messenger RNA 
from a second promoter, Mol. Cell Endocrinol. 108(l-2):23-33 [1995]); or a thrombin- 
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activatable fibrinolysis inhibitor (TAFI) promoter (e.g., Boffa, MB., et al, 
Characterization of the gene encoding human TAFI [thrombin-activatable fibrinolysis 
inhibitor; plasma procarboxypeptidase B], Biochemistry 38(20):6547-58 [1999]). 

For hematopoietic tissue-selective expression in hematopoietic precursor 
cells, useful promoters include a cyclin Al promoter (e.g., Muller, C, et al, Cloning of 
the cyclin Al genomic structure and characterization of the promoter region, J. Biol. 
Chem. 276(16): 11220-28 [1999]); a CD34 promoter (e.g., Burn, T.C., et al, 
Hematopoietic stem cell specific gene expression, U.S. Patent No. 5,556,954); or an 
integrin alphallb promoter (e.g., Wilcox, D. A., et al, Integrin alphallb promoter-targeted 
expression of gene products in megakaryocytes derived from retrovirus-transduced 
human hematopoietic cells, Proc. Natl. Acad. ScL USA 96(17):9654-59 [1999]). 

The foregoing examples of useful promoter sequences are not an 
exhaustive list, but are merely illustrative of the promoters available to the skilled artisan 
in practicing the present invention. 

For purposes of the present invention, "operatively linked" means that the 
promoter sequence, is located directly upstream from the PAPSS2 or Papss2 coding 
sequence, which is in reading frame, and that both sequences are oriented in a 5' to 3* 
manner, forming a transcriptional unit, such that transcription can take place in vitro in 
the presence of all essential enzymes, transcription factors, co-factors, activators, and 
reactants, under favorable physical conditions, e.g., suitable pH and temperature. This 
does not mean that, in any particular cell, conditions will favor transcription. 

By way of example, transcription of the DNA sequence encoding a 

PAPSS2 protein produces mRNA transcript, which is translatable into PAPSS2 protein, 

or a functional fragment thereof. Thus, one benefit of the present invention is that the 

nucleic acid can be used in a genetic replacement therapy for humans to correct clinical 

disorders derived from defective endogenous expression of PAPSS activity. Such 

defective expression can result from, but need not result from, endogenous 

underexpression of functional PAPSS2 protein, in particular. 
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Other preferred embodiments of the present nucleic acid construct also 
include, operatively linked as part of the transcriptional unit, a reporter gene sequence 
encoding a reporter protein for facilitating the detection or selection of cells containing 
the present nucleic acid construct and expressing PAPSS2 or Papss2. Preferably, but not 
necessarily, the reporter gene encodes a fluorescent protein. Fluorescent proteins include 
green fluorescent protein (or enhanced green fluorescent protein), yellow fluorescent 
protein, blue fluorescent protein, a phycobiliprotein, such as phycoerythrin or 
phycocyanin, or any other protein which fluoresces under suitable wave-lengths of light. 
Another reporter gene suitable for some applications is a gene encoding a protein that can 
enzymatically lead to the emission of light from a substrate(s); for purposes of the present 
invention, such a protein is a "light-emitting protein." For example, a light-emitting 
protein includes proteins such as luciferase or apoaequorin. 

Alternatively, the DNA segment encodes a PAPSS2 or Papss2 gene- 
specific antisense RNA, such as an antisense RNA that specifically hybridizes to PAPSS2 
mRNA or Papss2 mRNA, preventing translation therefrom. 

In another embodiment, the nucleic acid construct comprises a chimeric 
RNA/DNA, such as a chimeric RNA/DNA oligonucleotide. Such RNA/DNA chimers 
are useful, for example, in altering single nucleotide polymorphisms (SNPs) or other 
point mutations in an endogenous PAPSS2 gene sequence. For example, a SNP in 
PAPSS2 associated with encoding a truncated or otherwise dysfunctional PAPSS2 protein 
can be stably repaired using a chimeric RNA/DNA oligonucleotide of the present 
invention. 

In other embodiments, the nucleic acid construct is a probe or primer 
sequence. The inventive probe is a labeled polynucleotide that is useful in hybridization 
assays, such as Southern, Northern, or fluorescence in situ (FISH) hybidizations, for 
detecting the presence in a sample of polynucleotides having a gene-specific PAPSS2 or 
Papss2 sequence. The probe is fully complementary to SEQ. ID. NOS.:l, 2, 9, or 10, or 
is complementary to a gene-specific fragment of any of these. The suitable labels for a 
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probe are typically radioisotopes or fluorogens incorporated into or covalently linked to 
the polynucleotide by methods known in the art, and are detectable by conventional 
radiation or fluorescence detecting methods. 

The primer of the present invention is an oligonucleotide sequence that is 
useful in amplifying a nucleic acid segment from a sample that defines a nucleotide 
sequence specific to PAPSS2 or Papss2 by known methods of PCR, including RT-PCR. 
The primer is preferably 15 to 30 nucleotides long, and most preferably 17 to 22 
nucleotides long, but primers as short as 7 contiguous nucleotides may be useful for some 
gene-specific sequences. (E.g., Vincent, J., et al y Oligonucleonucleotides as short as 7- 
mers can be used for PCR amplification, DNA Cell Biol. 13(l):75-82 [1994]). 

Examples of useful oligonucleotide primer sequences for amplifying 
PAPSS2-specific nucleic acid segments include primer sequences comprising: 

5-TGGACCAAGGATGACGATGT-3' (SEQ. ID. NO.: 3; forward primer); 
5'-CGGAAAGATGGCAACAATGG (SEQ. ED. NO.: 4; reverse primer); 
5'-CTGGTGCTGGAAAAACAACG-3' (SEQ. ID. NO.: 5; forward primer); 
5-TGCGAATGGAGAAATAAAGCTG (SEQ. ID. NO.: 6; reverse primer); 
5 ' -GCC AGCC AGC ATGTCGGGGAT-3 ' (SEQ . ID. NO.:ll; forward primer); 
5 '-ACCTGAAACTCCTGAGCGTGTGCT-3 ' (SEQ. ID. NO.: 12; forward primer); 
5 '-GATGTGCCTCTAGACTGGCGG-3 ' (SEQ. ID. NO.:13; forward primer); 
5'-GAGCACTTCAGAAAGAAACTCTGG-3'(SEQ. ID. NO : 14; reverse primer); 
5'-CATCCGCCAGTCTAGAGGCAC-3' (SEQ. ID. NO.:15; reverse primer); 
5 9 - AGGTGTC AGACGGT ATTGGTC-3 5 (SEQ. ID. NO : 16; reverse primer); 
5'-GTCACTCACTGTGGACAAATTGG-3'(SEQ. ID. NO : 17; reverse primer); 
5'-CACCTCAGCAATCCGGCGGAT-3' (SEQ. ID. NO : 18; reverse primer);or 
5*-GCATGTCCAGACAGACACCAC-3' (SEQ. ID. NO.:28; reverse primer), but a 

complementary nucleotide sequence, or a gene-specific fragment of any of these is also 
useful; also useful are P^P&S^-specific primer sequences that overlap any of the afore- 
mentioned primer sequences at 5 or more contiguous nucleotide positions at their 5* or 3' 
end. 
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Useful primers for amplifying Papss2-$pecific sequences include primer 
sequences comprising: 

5 '-TCTGGCAC AAAGAGTTCGTG-3 ' [SEQ. ID. NO.: 19; reverse primer]); 
5 '-GCCAGTTTGTAAC CGAGT ATTC-3 ' (SEQ. ID. NO.:20; forward primer); 
5 '-GCAATTGGATACAGAGCAGC TA-3' (SEQ. ID. NO.:21; reverse primer); 
5'-GACAATGTCCGTCATGGCCTTA-3' (SEQ. ID. NO.:22; forward primer); 
5'-ATTCCCATTGTATTGCCCGTT-3' (SEQ. ID. NO.:23; forward primer); 
5 '-AACGGGCAATACAATGGGAAT-3 5 (SEQ. ID. NO :24; reverse primer); 
5 '-GATAAAGCTGGTGATGCAAACC-3 ' (SEQ. ID. NO.:25; reverse primer);5'- 
CATGG GATGGCGTGAGAT AC-3 ' (SEQ. ID. NO.:26; reverse primer); or 
5 ' -C AT A AGCTTTGCTTT GGAAGAGT-3' (SEQ. ID. NO.:27; forward primer); but a 
complementary nucleotide sequence, or a gene-specific fragment of any of these is also 
useful; also useful are Fapss2-specific primer sequences that overlap any of the afore- 
mentioned primer sequences at 5 or more contiguous nucleotide positions at their 5' or 3 1 
end. 

The present invention also relates to a pair of oligonucleotide primers (i.e., 
a primer set) comprising at least one forward and at least one reverse oligonucleotide 
primer, which together are capable of producing detectable nucleic acid amplification 
products from the 5* and 3* ends of a PAPSS2 or Papss2 nucleic acid template in a PCR- 
or RT-PCR-based nucleic acid amplification method. 

In some embodiments, the amplification products that the pair of primers 
can amplify include nucleic acid amplification products comprising (SEQ. ID. NO.:l) or 
(SEQ. ID. NO.: 9); a nucleotide sequence complementary to either of those; or a PAPSS2 
gene-specific fragment of any of these. Examples of this embodiment of the inventive 
pair of oligonucleotide primers include pairs containing a forward primer having a 
nucleotide sequence comprising (SEQ. ID. NOS.: 3, 5, or any of 1 1-13), a complementary 
nucleotide sequence, a gene-specific fragment of either of these, or a nucleotide sequence 
overlapping at 5 or more contiguous nucleotide positions any sequence of those afore- 
mentioned at its 5* or 3' end; and containing a reverse primer having a nucleotide 
sequence comprising (SEQ. ID. NO.: 4, 6, 14-18, or 28), a complementary nucleotide 
sequence, a gene-specific fragment of either of these; or a nucleotide sequence 
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overlapping at 5 or more contiguous nucleotide positions any sequence of those afore- 
mentioned at its 5 ! or 3 1 end. For example, useful oligonucleotide primer pairs for 
amplifying a PAPSS2 gene sequence include (SEQ. ID. NO.:3 and 4) or (SEQ. ID. NO.:5 
and 6). 

In other embodiments, the amplification products that the pair of 
oligonucleotide primers can amplify include nucleic acid amplification products having 
(SEQ. ID. NO.:2) or (SEQ. ID. NO.: 10); a nucleotide sequence complementary to either 
of those; or a Papss2 gene-specific fragment of any of these. Examples of this 
embodiment of the inventive primer pair include pairs containing a forward primer having 
a nucleotide sequence comprising (SEQ. ID. NO.:20, 22, 23, or 27), a complementary 
nucleotide sequence, a gene-specific fragment of either of these, or a nucleotide sequence 
overlapping at 5 or more contiguous nucleotide positions any sequence of those afore- 
mentioned at its 5' or 3' end; and containing a reverse primer having a nucleotide 
sequence comprising (SEQ. ID. NOS.: 19, 21, or any of 24-26), a complementary 
nucleotide sequence, a gene-specific fragment of either of these; or a nucleotide sequence 
overlapping at 5 or more contiguous nucleotide positions any sequence of those afore- 
mentioned at its 5' or 3' end. 

Other embodiments of the inventive nucleic acid construct include those in 
which, cloned into an expression vector, is a DNA segment encoding a PAPSS2 or 
Papss2 protein, operatively linked with any suitable promoter sequence, including a 
human PAPSS2 promoter, a promoter from another human or non-human PAPS 
synthetase gene (e.g., PAPSS1, Papssl, or Papssl promoters), or a predetermined 
promoter sequence from any other human or non-human gene, as described above. 
Suitable vector systems include, but are not limited to, viral vectors, for example, 
adenoviruses, adeno-associated viruses, retroviruses such as human immune-deficiency 
virus, other lentiviruses, such as Moloney murine leukemia virus and the retrovirus vector 
derived from Moloney virus called vesicular-stomatitis-virus-glycoprotein (VSV-G)- 
Moloney murine leukemia virus, mumps virus, and transfecting fragments of any of these 
viruses, and other viral DNA segments that facilitate the uptake of the desired DNA 
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segment by, and release into, the cytoplasm of cells and mixtures thereof. All of the 
above viruses may require modification to render them non-pathogenic or less antigenic. 
Other known vector systems, however, are also suitable for containing the DNA segment. 

The present invention also relates to a genetically modified vertebrate cell 
containing the nucleic acid construct of the present invention, regardless of the method by 
which the construct was introduced into the cell. Embodiments include cartilage-forming 
cells, such as chondrocytes, and liver cells, such as hepatocytes, as well as cells derived 
from other tissues such as, colon, lung, placenta, aortic or other vascular endothelium, 
bone marrow, or heart. 

If the nucleic acid construct includes a fluorescent reporter gene, as 
described above, the genetically modified cells of the present invention can be detected, 
isolated or selected from unmodified cells with the aid of, for example, a flow-activated 
cell sorter (FACS), set at the appropriate wavelength(s), or using conventional 
microscopic technology. 

In particular applications involving a genetically modified cell that 
expresses additional xenogeneic genes from any promoter, this expression may be linked 
to a reporter gene that encodes a different fluorescent or light-emitting protein from the 
reporter gene linked to PAPSS2 or Papss2 expression. Thus, multiple reporters 
fluorescing or emitting at different wavelengths can be chosen and cell selections based 
on the expression of multiple traits can be made. 

Gene delivery to the cell is by any suitable method including in vivo and 

vitro gene delivery methods. (E.g., D.T. Curiel et al, U.S. Patent Nos. 5,521,291 and 

5,547,932). Typically, gene delivery involves exposing a cell to a gene delivery mixture 

that includes preselected genetic material together with an appropriate vector, mixed, for 

25 example, with an effective amount of lipid transfecting agent (lipofection). The amount 

of each component of the mixture is chosen so that gene delivery to a specific species of 

cell is optimized. Such optimization requires no more than routine experimentation. The 

ratio of DNA to lipid is broad, preferably about 1:1, although other proportions may also 
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be utilized depending on the type of lipid agent and the DNA utilized. This proportion is 
not crucial Other well known gene delivery methods include electroporation or chemical 
methods. (E.g., M. Ostresh, No barriers to entry: transfection tools get biomolecules in 
the door, The Scientist 13(ll):21-23 [1999]). 

"Gene delivery agent", as used herein, means a composition of matter 
added to the genetic material for enhancing the uptake of exogenous DNA segment(s) 
into a vertebrate cell The enhancement is measured relative to the uptake in the absence 
of the gene delivery agent. Examples of gene delivery agents include 
adenovirus-transferrin-polylysine-DNA complexes. These complexes generally augment 
the uptake of DNA into the cell and reduce its breakdown during its passage through the 
cytoplasm to the nucleus of the cell. 

Other preferred gene delivery agents include Lipofectin®, DMRIE C, 
Cellfectin® or Lipofectamine (Life Technologies), LipoTAXI (Stratagene), Superfect or 
Effectene (Qiagen). Although these are not as efficient gene delivery agents as viral 
agents, they have the advantage that they facilitate stable integration of xenogeneic DNA 
sequence into the vertebrate genome, without size restrictions commonly associated with 
virus-derived gene delivery agents. A virus, or transfecting fragment thereof, can be 
used to facilitate the delivery of the genetic material into the cell. Examples of suitable 
viruses include adenoviruses, adeno-associated viruses, retroviruses such as human 
immune-deficiency virus, other lentiviruses, such as Moloney murine leukemia virus and 
the retrovirus vector derived from Moloney virus called vesicular-stomatitis-virus- 
glycoprotein (VSV-G)-Moloney murine leukemia virus, mumps virus, and transfecting 
fragments of any of these viruses, and other viral DNA segments that facilitate the uptake 
of the desired DNA segment by, and release into, the cytoplasm of cells and mixtures 
thereof. All of the above viruses may require modification to render them non-pathogenic 
or less antigenic. Other known vector systems are also useful. 

The present invention also relates to genetically modified non-human 
vertebrates comprising the present genetically modified cells, for example, non-human 
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primates, mice, rats, rabbits, gerbils, hamsters, canines, felines or other non-human 
mammals. Other vertebrates include birds such as chickens, turkeys, ducks, ostriches, 
emus, geese, guinea fowl, doves, quail, rare and ornamental birds, and the like. Broadly 
speaking, a genetically modified vertebrate is one that has had exogenous or foreign DNA 
permanently introduced into one or more of its cells, whether somatic or germ cells. 

The present invention also relates to an isolated human PAPS synthetase 
protein comprising a polypeptide having an amino acid sequence of (SEQ. ID. NO.: 7), 
i.e., the following contiguous amino acid residues, or an antibody binding fragment of the 
protein at least 6 amino acids long (amino acid residue 475 is in boldface): 

MSGIKKQKTENQQKSTNWYQAHHVSRNKRGQVVGTRGGFRGCT 

VWLTGLSGAGKTTISFALEEYLVSHAIPCYSLDGDNVRHGLNRNLGFSPGDREENIRR 

IAEVAKLFADAGLVCITSFISPFAKDR 

KGLYKRARAGEIKGFTGIDSDYEKPETPE 

TIIKDIHELFWENKLDHVRAEAE 

YLQVMHFDTLLDDGVINMSIPIVLPVSAEDKTRLEGCSKFVLAHGG 

EHRKEERCSRVWGTTCTKHPHIKMVMESGDWL 

LKQKCKEMNADAWAFQLRNPVH^ 

D 

DDWLDWRMKQHAAVLEEGVLDPKSTIVAIFPSPMLYAGPTEVQWHCRSRMIAGANFY 
IVGRDPAGMPHPETKKDLYEPTHGGKVLSMAPGLTSVEIIPFRVAAYN 
PARHNEFDFISGTRMRKLAREGENPPDGFMAPKAWKVLTDYYRSEM ID. 
NO.:7) 

This polypeptide sequence is encoded for example by (SEQ. ID. NO.:9), 
but is also encoded by other degenerate coding sequences. 

In one embodiment, the PAPSS2 protein forms part of a PAPSS2 fusion 
protein along with a second predetermined polypeptide segment, which can have an 
amino acid sequence related or unrelated to (SEQ. ID. NO.:7). The PAPSS2 fusion 
protein includes a first PAPSS2 polypeptide segment comprising an amino acid sequence 
of (SEQ. ID. NO.:7), an enzymatically active fragment thereof, or a gene-specific 
antibody binding fragment of (SEQ. ID. NO.: 7) at least 6 amino acids long; and a second 

26 

LAI 351813vl 



Patent 18810-81552 

predetermined polypeptide segment. The fusion protein is produced by known 
recombinant methods, and the second predetermined polypeptide sequence can be any 
desired polypeptide, for example, a known antigenic sequence, such as a specific goat, 
rabbit, or HLA antigen to which immunoreagents of commercially available 
immunoassay systems are directed. Thus, in some applications, conventional 
immunoassay systems can be used "off the shelf to detect and quantify a PAPSS2 fusion 
protein by detecting the second polypeptide sequence. 

In other applications, for example in protein therapy for the treatment of 
osteoarthritic disorders related to deficient enzymatic sulfation, the second polypeptide of 
the fusion protein is a polypeptide that allows the hybrid molecule to enter cells of a 
target tissue, for example, cartilage, liver, or platelets, whether in vitro or in vivo. For 
example, a second polypeptide can be chosen that infiltrates the cells of tissues, such as 
the human immunodeficiency vims (HIV) TAT protein (Schwarze, S.R., st cd., In vivo 
protein transduction: delivery of a biologically active protein into the mouse, Science 
285:1569-72 [1999]), and once within the cells, the PAPSS2 polypeptide segment of the 
fUsion protein becomes enzymatically active in the sulfation pathway within the cells. 

The present invention also relates to an isolated murine Papss2 protein 
comprising an amino acid sequence of (SEQ. ED. NO.: 8), i.e., the following contiguous 
amino acid residues, or an antibody binding fragment of the protein at least 6 amino acids 
long: 

MSANFKM>MKRDQQKSTNVVYQAHHVSRNKRGQVVGTRGGFRGC 

TVWLTGLSGAGKTTISFALEEYLVSHAIPCYSLDGDNVRHGLNBCNLGFSAGDREENIR 

RIAEVAKLFADAGLVCITSFISPFAKDRENARKIHESAGLPFFEIFVDAPLNICESRD 

VKGLYKRARAGEIKGFTGIDSDYEKPETPEC\T.KTNLSSVSDCVQQVVELLQEQNIVP 

HTTIKGIHELFWENKVDQIRAEAETLPSLPITKLDLQWVQILSEGWATPLKGFMREK 

EYLQTLHFDTLLDDGVINMSIPIVLPVSADDKARLEGCSKFALMYEGRRVALLQDPEF 

YEFlRKEERCSRVWGTATAKHPFnKMVIVIESGDWLVGGDLQVLERIRWDDGLDQYRLTP 

L 

ELKQKCKDMNADAVFAFQLRNPVHNGHALLMQDTRRRLLERGYKHPVLLLHPLGGWT 
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K 

DDDWLEWRMKQHAAVLEERV^ 

YIVGRDPAGMPHPETKKDLYEPTHGGKVLSMAPGLTSVEIIPFRVAAW 
DPARHEEFDFISGTRMRKLAREGEDPPDGFMAPKAWKVLTDYYRSEMDKTN^ ID. 

NO.:8). 

This polypeptide sequence is encoded for example by (SEQ. ED. NO.: 10), 
but is also encoded by other degenerate coding sequences. 

In one embodiment, the Papss2 protein forms part of a Papss2 fusion 
protein along with a second predetermined polypeptide segment, which can have an 
amino acid sequence related or unrelated to (SEQ. ID. NO.:8). The Papss2 fusion 
protein includes a first Papss2 polypeptide segment comprising an amino acid sequence 
of (SEQ. ID. NO.: 8) or a gene-specific antibody binding fragment thereof at least 6 amino 
acids long; and a second predetermined polypeptide segment. As above, the fusion 
protein is produced by known recombinant methods, and the second predetermined 
polypeptide sequence can be any desired polypeptide, for example, a known antigenic 
sequence, such as a specific goat, rabbit, or HLA antigen to which immunoreagents of 
commercially available immunoassay systems are directed. Thus, in some applications, 
conventional immunoassay systems can be used "off the shelf to detect and quantify a 
Papss2 fusion protein by detecting the second polypeptide sequence. 

The present invention also relates to an isolated antibody or antibody 
fragment that selectively binds a PAPS synthetase protein having an amino acid 
sequence of (SEQ. ID. NO.:7) or (SEQ.ID. NO.8), or selectively binds an antibody 
binding fragment of either of these at least 6 amino acids long. These are useful for 
detection and visualization of PAPSS2 or Papss2 proteins in various applications. The 
antibody or antibody fragment is raised against the PAPSS2 or Papss2 protein of the 
present invention. The antibody is raised in a mammal, from which it is isolated in 
serum, or is most preferably produced in a hybridoma cell line, by known methods, and is 
a monoclonal antibody. Antibody fragments of the present antibody are also useful; 
these include Fab 5 , F(ab')2, or F(v) fragments that are produced by known methods, 
typically by pepsinization of whole antibody. 

28 

LAI 351813vl 



Patent 



18810-81552 



The present invention also relates to a method of diagnosing 
spondyloepimetaphyseal dysplasia in a human subject. The method involves amplifying 
a nucleic acid segment from a sample. The sample is derived from a human subject 
having at least one symptom of spondyloepimetaphyseal dysplasia, and the sample is of a 
bodily substance containing human nucleic acid of the subject. The nucleic acid segment 
defines a sequence from human chromosomal region 10q23-24, between microsatellite 
markers D10S1143 and D10S2470, to produce amplification products; and analyzing the 
amplification products for the presence of homozygosity for a variant allele of a PAPSS2 
gene, the presence of homozygosity for the variant allele of the gene corroborating a 
diagnosis of spondyloepimetaphyseal dysplasia in the human subject. 

Any bodily substance containing human nucleic acids from the subject 
may be sampled, collected, and/or transported for the purpose of practicing the method. 
A most preferred and convenient substance for sampling is blood. However, a sample of 
hair root, urine, amniotic fluid, chorionic villus biopsy, skin biopsy, vascular or oral 
epithelium, spinal fluid, or other biopsy sample of any tissue is also useful. The sample is 
derived from cultured human cells, cell-free extracts, or other specimens indirectly 
derived from a subject's body, as well as from substances taken directly from a subject's 
body. Samples may be stored before amplification and/or analysis by well known storage 
means that will preserve nucleic acids in an analyzable condition, such as quick freezing, 
or a controlled freezing regime, in the presence of a cryoprotectant, for example, dimethyl 
sulfoxide (DMSO), glycerol, or propanediol-sucrose. Samples may also be pooled before 
or after storage for purposes of amplifying their nucleic acids for analysis. 

For purposes of the present invention "amplifying a nucleic acid segment " 
from a tissue sample of a subject "to obtain amplification products" encompasses any 
conventional means of amassing sufficient nucleic acid material for analysis. This 
includes, but is not limited to, amplification by conventional polymerase chain reaction 
methods (e.g., PCR, including RT-PCR). Alternatively, the practitioner can amplify the 
human subject's nucleic acids by in vitro cell culture and harvest of the subject's cultured 
cells, or by multiple sampling from the subject's tissues in vivo and pooling of multiple 
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tissue samples from a subject. Nucleic acids thus "amplified/' if they comprise a 
PAPSS2 gene or a gene-specific part thereof, are "amplification products" for purposes of 
the present invention. 

In a most preferred embodiment of the present method, PCR is used for 
amplifying the subject's nucleic acids, particularly from a PAPSS2 gene. If PCR is used, 
any combination of oligonucleotide primers that will amplify nucleic acid sequences of a 
PAPSS2 gene, or parts thereof, can be employed. Most preferably, amplification of the 
subject's nucleic acids can be achieved using P^SS^-specific oligonucleotide primers 
and primer pairs of the present invention, as described above. For example, useful 
primers comprise a nucleotide sequence of (SEQ. ID. NOS.:3-6, 11-18, or 28) or a 
P^P&S^-specific fragment of any of these at least 15 nucleotides long. In addition a 
sequence complementary to any of (SEQ. ID. NOS.:3-6, 11-18, or 28) or the PAPSS2- 
specific fragment, or any P^P&S^-specific nucleotide sequence at least 15 nucleotides 
long and overlapping at 5 or more contiguous nucleotide positions any of (SEQ. ID. 
NOS.:3-6, 11-18, or 28) at its 5' or 3' end, is a useful primer sequence. Examples of 
pairs of oligonucleotide primers useful for practicing the method include primer pairs 
comprising (SEQ. ID. NOS.:3 and 4) or (SEQ. ID. NOS.:5 and 6), but any of the present 
PAPSS2-specxfic primer pairs are useful. 

In a preferred embodiment of the method of the present invention, 
nucleotide sequencing of PAPSS2 can be used to analyze the amplification products of 
the nucleic acids in a human subject's tissue sample to detect a variant allele in a PAPSS2 
gene. A variant allele of PAPSS2 can be detected by any nucleotide sequencing means, 
for example conventional dideoxy sequencing or preferably by using a commercially 
available automated sequencer, then by comparing the subject's nucleotide sequences to 
other known human PAPSS2 sequences available in genomic sequence databases, such as 
GenBank. 

In a most preferred embodiment of the present method that employs 
nucleotide sequencing, sequencing of PAPSS2 may be accomplished by using 
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fluorescence-based single strand conformation polymorphism analysis (SSCP), a routine 
and reliable means of identifying point mutations, small insertions or deletions, (IS, 
Ellison, Fluorescence-based mutation detection. Single-strand conformation 
polymorphism analysis [F-SSCP], MoL Biotechnol 5(1): 17-31 [1996]; H. Iwahana et 
al. t Multiple fluorescence-based PCR-SSCP analysis using internal fluorescent labeling 
of PGR products, Biotechniques 21(3):510-14, 516-19 [1996]; R. Makino et al, F-SSCP: 
fluorescence-based polymerase chain reaction-single-strand conformation polymorphism 
[PCR-SSCP], PCR Methods Appl. 2(1): 10-13 [1992]). An automated system may be 
used, such as an Applied Biosystems DNA sequencer, equipped with GENESCAN 672, 
Genetyper, or another appropriate analytical software package. 

Optionally, high throughput analysis is achieved by PCR multiplexing 
techniques well known in the art. (E.g., Z. Lin et al, Multiplex genotype determination at 
a large number of gene loci, Proc. Natl. Acad. Sci. USA 93(6):2582-87 [1996]). 

In another preferred embodiment of the present method, nucleotide 
sequencing is unnecessary for analyzing the amplification products. For example, 
heteroduplex analysis on high resolution gel matrices can be employed by the skilled 
practitioner to detect even single nucleotide polymorphisms. (M.T. Hauser et al, 
Generation of co-dominant PCR-based markers by duplex analysis on high resolution 
gels, Plant. J. 16(1): 117-25 [1998]). The PCR/OLA procedure can be used for analyzing 
amplification products to detect SNPs in PAPSS2. (B.R. Glick and J. J. Pasternak, 
Molecular Biotechnology: Principles and Applications of Recombinant DNA, ASM Press, 
Washington, D.C., pp. 197-200 [1994]). Conformation-sensitive gel electrophoresis of 
amplification products can also be employed as a means of analysis by the skilled artisan 
in practicing the methods of the present invention. (A. Markoff et aL, Comparison of 
conformation-sensitive gel electrophoresis and single strand conformation polymorphism 
analysis for detection of mutations in the BRCA1 gene using optimized conformation 
analysis protocols, Eur. J. Genet. 6(2): 145-50 [1998]). 
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A skilled practitioner will know that electrophoresis for analyzing 
amplification products is done rapidly and with high sensitivity by using any of various 
methods of conventional slab or capillary electrophoresis, with which the practitioner can 
optionally choose to employ any facilitating means of nucleic acid fragment detection, 
including, but not limited to, the use of radionuclides, biotin, UV-absorbance or laser- 
induced fluorescence. (K. Keparnik et al, Fast detection of a (CA)18 microsatellite 
repeat in the Ig E receptor gene by capillary electrophoresis with laser-induced 
fluorescence detection, Electrophoresis 19(2);249-55 [1998]; H. Inoue et al, Enhanced 
separation of DNA sequencing products by capillary electrophoresis using a stepwise 
gradient of electric field strength, J. Chromatogr. A. 802(1): 179-84 [1998]; N.J. Dovichi, 
DNA sequencing by capillary electrophoresis, Electrophoresis 18(12-13):2393-99 [1997]; 
H. Arakawa et ah, Analysis of single-strand conformation polymorphisms by capillary 
electrophoresis with laser induced fluorescence detection, J. Pharm. Biomed. Anal. 15(9- 
10): 1537-44 [1997]; Y. Baba, Analysis of disease-causing genes and DNA-based drugs 
by capillary electrophoresis. Towards DNA diagnosis and gene therapy for human 
diseases, J. Chromatgr B. Biomed. Appl 6 87(2): 27 1-3 02 [1996]; K.C. Chan et al, High- 
speed electrophoretic separation of DNA fragments using a short capillary, J. 
Chromatogr B. Biomed. Sci. Appl. 695(1):13-15 [1997]). 

Any of diverse fluorescent dyes are optionally used to label primers or 
amplification products for ease of analysis, including but not limited to, fluorescein, 
SYBR Green I, YO-PRO-1, thiazole orange, Hex (i.e., 6-carboxy-2',4',7',4,7- 
hexachlorofluoroscein), FAM (i.e., 6-carboxyfluorescein), or TET (i.e., 4,7,2',7- 
tetrachloro-6-carboxyfluoroscein). (E.g., J. Skeidsvoll and P.M. Ueland, Analysis of 
double-stranded DNA by capillary electrophoresis with laser-induced fluorescence 
detection using the monomeric dye SYBR green I, Anal Biochem. 23 1(20):359-65 [1995]; 
H. Iwahana et al, Multiple fluorescence-based PCR-SSCP analysis using internal 
fluorescent labeling of PCR products, Biotechniques 21(30:510-14, 516-19 [1996]). 

Analyzing the amplification products can also be done by means of 
restricting the amplification products with one or more restriction enzymes, for example 



LAI 351S13vl 



32 



Patent 



18810-81552 



Hindi, followed by separation of the resulting fragments and analysis of fragment length 
or differential fragment migration in denaturing high-performance liquid chromatography 
(DHPLC) or gel electrophoresis, as above, including restriction-capillary electrophoresis. 
For example, this may be achieved by techniques known in the art, such as PCR- 
restriction fragment-SSCP, which can detect single base substitutions, deletions or 
insertions. (M. Tawata et ah y A mass screening device of genome by polymerase chain 
reaction-restriction fragment-single strand conformation polymorphism analysis, Genet- 
Anal. 12(3-4): 125-27 [1996]; H.H. Lee et ah, Mutational analysis by a combined 
application of the multiple restriction fragment-single strand conformation polymorphism 
and the direct linear amplification DNA sequencing protocols, Anal Biochem. 
205(2);289-93 [1992]). 

Variant alleles of a PAPSS2 gene may be of any nucleotide sequence 
defining an open reading frame, or part thereof, that encodes PAPSS2 protein, or of any 
nucleotide sequence, or part thereof, defining a regulatory region of a PAPSS2 gene. A 
PAPSS2 regulatory region may include, but is not limited to, a promoter region, an 
enhancer, a termination sequence, or any other nucleotide sequence, or part thereof, that 
regulates the transcription of PAPSS2 protein. Any mutation or polymorphism in a 
PAPSS2 gene that results in a loss of PAPSS2 expression or a decrease in PAPSS2 
functional activity relative to a normal activity range, or which could do so if it were not 
masked by the presence in vivo of a fully functional PAPSS2 allele or by another 
compensatory physiologic mechanism, are contemplated by the present invention. 

For example, one variant allele of PAPSS2, characteristic of SEMD 
Pakistani-type, defines a stop codon instead of a serine codon corresponding to amino 
acid residue 475 of SEQ. ID. NO.: 7, and results in synthesis of a truncated PAPSS2 
protein 474 amino acids long. Homozygosity for this variant allele is associated with 
SEMD Pakistani-type in a human subject, and thus corroborates a diagnosis of SEMD 
Pakistani type. 



LAI 351813vl 



33 



Patent 18810-81552 

However, detecting heterozygosity for a variant PAPSS2 allele is also 
useful, for genetic counseling purposes. Consequently, the present invention also relates 
to a method of identifying a human carrier of an heritable allele associated with 
spondyloepimetaphyseal dysplasia. As in the present method of diagnosing 
spondyloepimetaphyseal dysplasia, the method of identifying a carrier involves 
amplifying a nucleic acid segment from a sample derived from a human subject without 
symptoms of spondyloepimetaphyseal dysplasia, to produce amplification products. As 
described above, the sample is of a bodily substance containing human nucleic acid of the 
subject. The nucleic acid segment defines a sequence from human chromosomal region 
10q23-24, between microsatellite markers D10S1 143 and D10S2470. The amplification 
products are analyzed for the presence of a variant allele of a PAPSS2 gene, which is 
accomplished as described above. The presence of the variant allele of the gene 
identifies the human subject as a carrier of an heritable allele associated with 
spondyloepimetaphyseal dysplasia. A benefit of the present method of identifying a 
human carrier of an heritable allele associated with SEMD, is that couples in which one 
or both of the spouses is identified as a carrier of a SEMD-associated allele, can make 
more informed reproductive decisions. 

The present invention also relates to a gene therapy method for treating a 
human subject having an osteoarthritic disorder that is caused or aggravated by deficient 
enzymatic sulfation activity. Examples include spondyloepimetaphyseal dysplasia, 
Stickler syndrome, spondyloepiphyseal dysplasia, and achondrogenesis; achondroplasia; 
chondrodysplasia; diastrophic dysplasia; the collagen oligomeric matrix protein (COMP) 
disorders such as pseudoachondroplasia and multiple epiphyseal dysplasia. In one 
embodiment, the method involves exposing, in vitro, one or more cells derived directly 
(e.g., biopsied) or indirectly (e.g., cultured) from a tissue of a human subject having at 
least one symptom of an osteoarthritic disorder caused or aggravated by deficient 
enzymatic sulfation activity to a gene delivery mixture comprising at least one gene 
delivery agent and a nucleic acid construct having a nucleic acid segment encoding a 
PAPS synthetase protein having an amino acid sequence of (SEQ. ID. NO.:7) or (SEQ. 
ID. NO.: 8); causing the nucleic acid construct to be taken up by, and released into the 
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cell(s), such that the nucleic acid construct is incorporated into the genome of the cell(s), 
whereby under appropriate physiological conditions expression of functional PAPS 
synthetase protein is enhanced in the cell(s), compared to unmodified cells of the same 
kind. The genetically modified cells are then implanted into the tissue of the human 
subject, where PAPS synthetase activity is thereby enhanced relative to pre-implantation, 
and at least one symptom associated with the osteoarthritic disorder in the subject is 
improved. 

For purposes of treating symptoms of aberrant skeletal growth, the human 
subject is preferably a juvenile, and most preferably a prepubescent juvenile. However, 
for purposes of treating symptoms of joint degeneration associated with osteoarthritic 
disorders, such as SEMD or chondrodysplasia, and for stabilizing the cartilage matrix, the 
human subject is any age. In one embodiment, the gene therapy method for treating an 
osteoarthritic disorder caused or aggravated by deficient enzymatic sulfation activity is 
directed to symptom(s) related to deficient sulfation activity that may arise in liver cells, 
e.g., hepatocytes, in platelets, or in vascular tissue, and these symptoms are treated in 
accordance with the method at any age, depending only on the practitioner's best 
estimation of the physiologic resilience of the individual patient in facing the stresses of 
the procedure. 

These stresses are typically minimal, as cells can be obtained by 
pertcutaneous or laparoscopic biopsy. However, cells are also obtained by any other 
suitable surgical means or from a pre-existing culture derived from a tissue of the subject 
or other viable sample of the subject's cells. Implantation of the genetically modified 
cells is accomplished by conventional percutaneous, laparoscopic, stereotactic, or other 
surgical methods. 

The method involves genetically modifying a cell(s) derived from a tissue 
of the subject. Preferably, but not necessarily, this is a cartilage-forming cell, such as a 
chondrocyte from an epiphyseal region. However, other cell types can also be genetically 
modified in accordance with the method to improve osteoarthritic symptoms that may be 
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associated with other tissues, for example hepatocytes, platelets, or vascular endothelium. 
The cells, for example, chondrocytes, hepatocytes, hematopoietic precursor cells, or 
vascular endothelial cells, are obtained from an appropriate tissue, such as the epiphyses, 
joints, liver, bone marrow, or vascular tissue. 

The cell(s) are genetically modified. This can be by any suitable gene 
delivery method as described above, preferably in vitro, for example by transfection, 
transduction, or electroporation, but in vivo methods of gene delivery are also 
contemplated. The cell(s) is exposed to a gene delivery mixture comprising at least one 
gene delivery agent, as described above, and a nucleic acid construct, preferably DNA, 
containing a nucleic acid segment encoding a PAPS synthetase protein having an amino 
acid sequence of (SEQ. ID. NO.:7), for example (SEQ. ID. NO.:9), or containing a 
nucleic acid segment encoding a PAPS synthetase protein having an amino acid sequence 
of (SEQ. ID. NO.:8), for example (SEQ. ID. NO.: 10), operatively linked, in a 
transcriptional unit, to a predetermined promoter sequence that operates in the cell. 

The nucleic acid construct that is used in the present gene therapy method 
optionally contains a reporter gene for convenient detection, isolation or selection of 
genetically modified cells expressing PAP synthetase protein from an exogenously 
supplied gene, as described above. 

The construct is then taken up by and released into the cell, and is 
incorporated into the genome of the cell. Under appropriate physiological conditions, 
transcription of the PAPSS2 or Papss2 encoding region occurs, followed by translation, 
and thus, expression of PAPSS2 or Papss2 protein is enhanced in the cell, compared to 
unmodified cells of the same kind. With the cell(s) implanted in the tissue of the subject, 
one or more symptoms of the osteoarthritic disorder in the subject are thereby improved 
due to enhanced enzymatic sulfation in the tissue of biochemicals that require sulfation 
for bioactivity or have increased bioactivity in humans in a sulfated form. 
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Under some circumstances, the practitioner may deem it desirable to apply 
the present gene therapy method to a fetal subject that has been identified as being 
homozygous for a variant allele of a PAPSS2 gene by the present method of diagnosing 
spondyloepimetaphyseal dysplasia. In such instances, chondrocytes and stem cell 
precursors of chondrocytes can be obtained from fetal cartilage, can be genetically 
modified and re-implanted while the subject is in utero. 

Another embodiment of the gene therapy method of the present invention 
involves a gene modification strategy employing a chimeric RNA/DNA oligonucleotide 
construct, particularly useful in repairing SNPs associated with a SEMD phenotype, such 
as the allele encoding a stop codon instead of serine at position 475 of (SEQ. ID. NO.:7) 
(e.g., "TAA" instead of "TCA" at nt. +1423 through +1425 of SEQ. ID. NO.:l; see 
Figure 3), associated with SEMD Pakistani type. (E.g., Kren, B.T., et al, Gene repair 
using chimeric RNA/DNA oligonucleotides, Semin. Liver Dis. 19(1):93-104 [1999]; 
Alexeev, V. & Yoon, K., Stable and inheritable changes in genotype and phenotype of 
albino melanocytes induced by an RNA-DNA oligonucleotide ; Nat. Biotechnol 
16(13): 1343-46 [1998]; Cole-Strauss, A., et al, Targeted gene repair directed by the 
chimeric RNA/DNA oligonucleotide in a mammalian cell free extract, Nucleic Acids Res. 
27(5): 1323-30 [1999]; Xiang, Y., et al., Targeted gene conversion in a mammalian 
CD 34 + -enriched cell population using a chimeric RNA/DNA oligonucleotide, J. Mol. 
Med. 75(ll-12):829-35 [1997]; Cole-Strauss, A., et al, Correction of the mutation 
responsible for sickle cell anemia by an RNA-DNA oligonucleotide, Science 
273(5280): 1386-89 [1996]; Yoon, K, et al, Targeted gene correction of episomal DNA 
in mammalian cells mediated by a chimeric RNA.DNA oligonucleotide, Proc. Natl. Acad. 
Sci. USA 93(5):2071-76 [1996]). Briefly, this gene modification strategy exploits the 
recombinogenicity of RNA-DNA hybrids and features the use of chimeric RNA/DNA 
oligonucleotides with hairpin capped ends that avoid destabilization or destruction by 
cellular helicases or exonucleases; an RNA/DNA oligonucleotide is designed so that it 
perfectly aligns with a specific genomic target sequence except for a single mismatched 
basepair, which is recognized by endogenous DNA repair systems that can then convert 
the SNP to a desired normal sequence. 
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In this embodiment, the cells are exposed to a nucleic acid construct that 
includes a chimeric RNA/DNA oligonucleotide. This chimeric RNA/DNA comprises a 
nucleic acid segment of a PAPSS2 gene comprising a nucleotide sequence of (SEQ. ID. 
NO.:9), (SEQ. ID. NO.: 10), or a gene-specific fragment of either of these. The nucleic 
acid construct is taken up by, and released into, the cell, and the nucleotide sequence of 
the nucleic acid segment is incorporated into the genome of the cell, whereby expression 
of functional PAPS synthetase protein is enhanced in the cell. Under appropriate 
physiological conditions, transcription of the PAPSS2 or Papss2 encoding region occurs, 
followed by translation, and thus, expression of PAPS synthetase protein is enhanced in 
the cell, compared to unmodified cells of the same kind. The genetically modified cell(s) 
is then implanted into the tissue of the human subject, as described above. With the 
cell(s) implanted in the tissue of the subject, one or more symptoms of the osteoarthritic 
disorder in the subject are thereby improved due to enhanced enzymatic sulfation in the 
tissue of biochemicals that require sulfation for bioactivity or have increased bioactivity 
in humans in a sulfated form. 

Thus, the gene therapy method provides a much needed way to improve or 
eliminate symptoms of aberrant skeletal growth or joint degeneration associated with 
osteoarthritic disorders such as SEMD. The method provides a way to improve 
symptoms resulting from deficient sulfation activity in other cell types, such as 
hepatocytes, platelets, or vascular epithelium, for example, abnormally long bleeding 
time. 

The present invention is also related to a protein therapy method for 
treating a human subject having an osteoarthritic disorder. The osteoarthritic disorder is 
one that is caused or aggravated by deficient enzymatic sulfation activity, as described 
above. The method employs the inventive PAPSS2 fusion protein. The method involves 
exposing a cell(s) of a tissue of a human subject having the osteoarthritic disorder to a 
fusion protein comprising a first PAPSS2 polypeptide segment that comprises an amino 
acid sequence of (SEQ. ID. NO.:7), or an enzymatically active fragment thereof, and a 
second polypeptide segment capable of infiltrating the cell, whereby the fusion protein is 
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taken up by the cell and the PAPSS2 polypeptide segment is enzymatically active within 
the cell. Exposing the cell(s) is preferably accomplished in vivo, for example by injecting 
or infusing the fusion protein into the subject. An example of a second polypeptide 
segment capable of infiltrating a diversity of cell types is an human immunodeficiency 
virus (HIV) TAT protein. (Schwarze, S.R., et al [1999]). But a tissue-specific second 
polypeptide can also be used. 

The skilled artisan is aware of suitable means of detecting an improvement 
in the subject's symptom(s) resulting from treatment by the inventive genetic or protein 
therapy methods. For example, a useful indicator of improvement in symptoms of 
aberrant skeletal growth is a comparison of the subject's growth rate over a period of 
months or years after treatment using the method, with the rate of growth before 
treatment. It is also useful to compare the subject's growth over time after treatment with 
standard growth curves or tables for humans of the same age and sex. Improvement in 
joint function, for example reduction in joint pain or enhanced joint movement, or joint 
appearance on x-ray, are useful indicators. Another useful indicator is improvement in 
platelet function, which is detected in normalized or shortened bleeding time for the 
subject. 

Alternatively, biochemical or molecular methods are employed to detect 
improvement. For example, assays showing increased sulfation activity in a tissue biopsy 
sample(s) from the subject, such as in cartilage, liver, or blood samples and/or extracts, 
are useful in determining the effectiveness of the method. Commonly, such assays are 
based on direct measurement of in vivo and/or in vitro PAPS synthetase enzymatic 
activity or detecting the assimilation of sulfate into APS (i.e., sulfate activation assay) or 
into other sulfated metabolites, for example measuring the degree of sulfation of aggrecan 
side chains. Radiolabeled (e.g., 35 S) sulfate and chromatographic separations are 
common tools used in such assay methods. Detection of PAPSS2-specific mRNA 
transcripts in tissue samples, using the probes or primers of the present invention, can 
also be employed as an indirect measure of improvement. 
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Alternatively, the nucleic acid construct that is used in the present gene 
therapy method can optionally contain in the transcriptional unit a reporter gene, as 
described above, for convenient detection of genetically modified cells expressing 
exogenous PAPSS2 in a tissue biopsy sample(s) from the subject. 

The present invention is also related to a genetic testing kit for diagnosing 
SEMD in a human subject. The kit is an assemblage of materials for facilitating 
amplifying nucleic acids from a human subject comprising PAPSS2 nucleotide sequences 
and/or analyzing PAPSS2-&vxivq& amplification products. A genetic testing kit of the 
present invention comprises at least one oligonucleotide primer of the present invention 
and preferably comprises a pair of oligonucleotide primers of the present invention, 
together with instructions for the practice of the methods of the present invention. A 
most preferred embodiment of the genetic testing kit of the present invention comprises 
an inventive pair of oligonucleotide primers, as described above, for example (SEQ. ID. 
NOS.:3 and 4) or (SEQ. ID. NOS.:5 and 6). A preferred embodiment of the genetic 
testing kit of the present invention can incorporate an array of oligonucleotide primers 
specific for SNPs in human PAPSS2, preassembled in a "DNA chip" (or "gene chip") 
configuration for facilitating the amplifying of nucleic acids and the analyzing of 
amplification products. (E.g., J.G.Hacia et al, Enhanced high density oligonucleotide 
array-based sequence analysis using modified nucleoside triphosphates, Nucleic Acids 
Res. 26(2):4975-82 [1998]; R.W. Wallace, DNA on a chip: serving up the genome for 
diagnostics and research, Mol. Med. Today 3(9):384-89 [1997]; T. Pastinen et al, 
Minisequencing: a specific tool for DNA analysis and diagnostics on oligonucleotide 
arrays, Genome Res. 7(6); 606- 14 [1997], M.T. Cronin et al, Cystic fibrosis mutation 
detection by hybridization to light-generated DNA probe arrays, Hum. Mutat. 7(3):244- 
55 [1996]; A.C. Pease et al, Light-generated oligonucleotide arrays for rapid DNA 
sequence analysis, Proc. Natl. Acad. Sci. USA 91(ll):5022-26 [1994]; E M. Southern et 
al, Arrays of complementary oligonucleotides for analyzing the hybridisation behaviour 
of nucleic acids, Nucleic Acids Res. 22(8): 1368-73 [1994]). 
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The present invention also relates to a kit for genetically modifying a 
vertebrate cell. The kit is a ready assemblage of materials or components for facilitating 
the genetic modification of a vertebrate cell. The kit includes a polynucleotide 
comprising a PAPSS2 sequence having SEQ. ID. NOS.:l, or a Papss2 sequence having 
SEQ. ID. NO.:2, a sequence complementary to either of these, or a degenerate coding 
sequence thereof (i.e., encoding SEQ. ID. NOS.:7 or 8), or a gene-specific fragment of 
any of these, as described herein with respect to the nucleic acid construct of the present 
invention. Preferably the polynucleotide includes a transcriptional unit that contains the 
human PAPSS2- or murine Papss2-encoding nucleic acid segment, operatively linked to a 
functional promoter sequence, and/or a reporter gene for facilitating detection, isolation, 
or selection of genetically modified cells from unmodified cells. The kit optionally 
contains a suitable gene delivery agent. The kit is particularly useful in practicing the 
gene therapy method for treating a human subject having an osteoarthritic disorder that is 
caused or aggravated by deficient enzymatic sulfation activity. 

However, in several embodiments, the kit includes a DNA sequence 
encoding the PAPSS2 or Papss2 protein, or fragment thereof at least 6 amino acids long, 
in a sense or antisense orientation, as appropriate for a particular application. 
Alternatively, the kit includes a PAPSS2- or Papss2 -specific chimeric RNA/DNA 
oligonucleotide, as described above. Some embodiments of the kit are configured for use 
in practicing the present method of treating a human subject having an osteoarthritic 
disorder, as described above. 

The present invention also relates to a kit for practicing the inventive 
method of protein therapy for treating a human subject having an osteoarthritic disorder 
that employs the inventive PAPSS2 fusion protein, as described above. The kit contains a 
fusion protein comprising a first PAPSS2 polypeptide segment that comprises an amino 
acid sequence of (SEQ. ID. NO.: 7), or an enzymatically active fragment thereof, and a 
second polypeptide segment capable of infiltrating the cell, whereby the fusion protein is 
taken up by the cell and the PAPSS2 polypeptide segment is enzymatically active within 
the cell. An example of a second polypeptide segment capable of infiltrating cell(s) is an 
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human immunodeficiency virus (HIV) TAT protein, which can infiltrate diverse cell 
types, but other polypeptide segments are also useful. 



The materials or components assembled in the present kits for diagnosing 
SEMD or identifying a carrier of a heritable marker associated with SEMD and kits for 
genetically modifying a vertebrate cell or for protein therapy can be provided to the 
practitioner stored in any convenient and suitable ways that preserve their operability and 
utility. For example the components can be in dissolved, dehydrated, or lyophilized 
form; they can be provided at room, refrigerated or frozen temperatures. The kits of the 
present invention include instructions for using the materials or components effectively 
for their intended purpose(s). 

It is contemplated that the polynucleotides, nucleic acid constructs, 
primers, primer pairs, and kits of the present invention can be used therapeutically to 
detect SEMD in patients with specific variant alleles of PAPSS2. These variant alleles 
can be targeted for gene therapy directed at repairing malfunctional PAPSS2 alleles or 
they may suggest specific chemotherapy directed to enhancing endogenous PAPS 
synthetase activity or may be directed to other metabolic targets in biochemical pathways 
mediated by PAPS synthetase activity. 

The foregoing descriptions of the isolated polynucleotides, nucleic acid 
constructs, genetically modified cells and vertebrates, methods, and kits of the present 
invention are illustrative and by no means exhaustive. The invention will now be 
described in greater detail by reference to the following non-limiting examples. 

EXAMPLES 

Example 1 : Methods . 

Genotvping . Synthetic oligonucleotide primers were obtained for 
genotyping in the SEMD family with polymorphic microsatellite markers from the CHLC 
Human Screening Set/Weber version 8 (Research Genetics). Additional markers were 
derived from a physical map of chromosome 10 (Gray, I.C., et aL 7 An integrated physical 
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and genetic map spanning chromosome band I0q24, Genomics 43:85-88 [1997]) and 
databases from the Whitehead Institute for Biomedical Research, the Cooperative Human 
Linkage Center, and Genethon. 

Similar methods were used for analysis of microsatellite markers in the 
brachymorphic mice, using DNA samples from backcross animals recombinant in the bm 
region. (Rusiniak, M.E., et al. y Molecular markers near the mouse brachymorphic (bm) 
gene, which affects connective tissues and bleeding time, Mamm. Genome 7:98-102 
[1996]). Markers were derived from the Whitehead Institute for Biomedical Research 
mouse genetic and physical mapping database (http://www-genome.wi.mit.edu/cgi- 
bin/ mouse/index) . 

Linkage Calculations . Two-point linkage analyses were conducted for 
eight markers in the region of D10S1143 (see Figures 1 and 2 and Table 1). LOD scores 
were calculated with the MLINK subroutine of the LINKAGE 5.1 package (Terwilliger, 
J.D. & Ott, J., Handbook of human genetic linkage, Baltimore: John Hopkins University 
Press [1994]) using a Pentium PC. Because of the computational problems introduced by 
extensive inbreeding and multiple generations with no genotyping or phenotyping data, 
the SEMD pedigree was divided into three separate sub-pedigrees. Individuals in the 
three nuclear families from the most recent generation, including each of their ancestors 
back to the founding couple as well as all inbreeding loops, were analyzed separately. 
The genotypes of each individual were analyzed only once. 

SEMD was modeled as an autosomal recessive fully penetrant disease with 
an allele frequency of .0001. LOD scores were summed across the pedigrees. The LOD 
scores of -« reflect recombinations within nuclear families for markers D10S2470 and 
D10S1753. Other markers did not recombine within the nuclear families, so that q = 0.0 
could not be ruled out for them, although parsimony indicates that recombination has 
probably occurred within the pedigree for each of these markers. The maximum LOD 
scores for markers D10S1143 and D10S2470 at q=.01 were consistent with the results of 
the haplotype analysis. Because marker allele frequencies in the Pakistani population are 
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unknown, for each marker, the allele frequencies were set at 1/N, where N was the 
number of observed alleles in the pedigree. 



Radiation Hybrid Mapping . DNA from the Genebridge 4 Radiation 
Hybrid (RH) panel was obtained from Research Genetics. For the PAPSS2 gene, a 236 
bp PCR product was amplified using primers 5 ' -CTGGTGCTGGAAAAAC AACG-3 ' 
(SEQ. ID. NO.:5; forward primer) and 5 ' -TGCGAATGGAGAA AT AAAGCTG-3 ' 
(SEQ. ID. NO :6; reverse primer). RH data were submitted to the Whitehead server 
(http://www-genome.wi.mit.edu/cgi-bin/contig/rhmapper.pl) for placement on the RH 
map. 

Computer Methods . BLAST searches (Altschul, S.F.,e/ a/., Gapped 
BLAST and PSI-BLAST: a new generation of protein search programs, Nucleic Acids 
Res. 25:3389-3402 [1997]) and determinations of sequence homology were performed 
using the National Center for Biotechnology Information 
(http://www.ncbi.nlm.nih.gov/BLAST/) and Baylor College of Medicine 
(http://dotimgen.bcm.tmc.edu:933 l/seq-search/nucleic_acid-search.html) servers. The 
phenoytpic map of the mouse genome was accessed through the Mouse Genome 
Informatics server (http://www.informatics.jax.org), maintained by the Jackson 
Laboratory. 

cDNA library analysis . To isolate the human PAPSS2 cDNA clones, 
oligonucleotide primers B (SEQ. ID. NO.: 12, below) and E (SEQ. ID. N0.:15, below) 
were designed from the EST sequences identified by BLAST searching to amplify an 826 
bp cDNA fragment by RT-PCR from human fetal cartilage RNA. The fragment was 
hybridized to plaques from a directional human fetal cartilage cDNA library constructed 
by Stratagene in the lambda ZAP2 vector (D. Krakow and D. Cohn, unpublished). 
Alternatively, a forward primer (SEQ. ID. NO.: 11, below) and a reverse primer 5'- 
GC ATGTCC AGAC AGAC ACC AC-3 ' (SEQ. ID. NO.:28) were used to produce, from 
fetal cartilage RNA, a cDNA sequence 1940 base pairs long, including the entire PAPSS2 
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coding region and flanking regions, which nucleic acid segment was then used as a 
hybridization probe of the fetal cartilage cDNA library. 

Primers used for sequence analysis of the cDNA clones were: 

(A) 5 ' -GCC AGCC AGC ATGTCGGGGAT-3 '(SEQ. ID. NO.:ll; forward primer); 
5 (B) 5 ' -ACCTGAAACTCCTGAGCGTGTGCT-3 ' (SEQ. ID. NO. : 12; forward primer); 

(C) 5 ' -GATGTGCCTCTAGACTGGCGG-3 ' (SEQ. ID. NO.:13; forward primer); 

(D) 5'-GAGCACTTCAGAAAGAAACTCTGG-3'(SEQ. ID. NO.: 14; reverse primer); 

(E) 5'-CATCCGCCAGTCTAGAGGCAC-3' (SEQ. ID. NO.: 15; reverse primer); 

(F) 5 '- AGGTGTC AGACGGT ATTGGTC-3 ' (SEQ. ID. NO.:16; reverse primer); 

10 (G) 5 ' -GTC ACTC ACTGTGGAC AAATTGG-3 '(SEQ. ID. NO.:17; reverse primer); and 
(H) 5'-CACCTCAGCAATCCGGCGGAT-3' (SEQ. ID. NO.: 18; reverse primer). 

The sequence of 3' end of the coding region of the mouse Papss2 cDNA 
was identified by database searching. The 5' end of the cDNA was isolated from a mouse 
brain cDNA library, constructed from the C57BL/6 strain in the Uni-ZAP XR vector 

15 (Stratagene), by amplification from pooled library DNA using a gene specific primer 
derived from the sequence at locus D19MU13 (5 '-TCTGGC AC AAAGAGTTCGTG-3 ' 
[SEQ. ID. NO.: 19; reverse primer]) and a vector-specific primer. To amplify the entire 
coding region of Papss2, oligonucleotide primers from the 5' ([a] 5'- 
GCCAGTTTGTAAC CGAGT ATTC-3 ' [SEQ. ID. NO.:20; forward primer]) and 3' ([b] 

20 5'-GCAATTGGATACAGAGCAGC TA-3' [SEQ. ID. NO.:21; reverse primer]) 
untranslated regions were used. Sequence was determined using the 5' and 3' 
untranslated region primers as well as primers: 

(c) 5 '-GACAATGTCCGTC ATGGCCTT A-3 ' (SEQ. ID. NO.:22; forward primer); 

(d) 5'-ATTCCCATTGTATTGCCCGTT-3' (SEQ. ID. NO.:23; forward primer); 

25 (e) 5'-AACGGGC AATACAATGGGAAT-3 ' (SEQ. ID. NO. :24; reverse primer); and 
(f) 5 ' -GAT AAAGCTGGTGATGC AAACC-3 ' (SEQ. ID. NO.:25; reverse primer). 

Mutation analysis . To define the human mutation, RNA was purified from 

a lymphoblastoid cell line from an affected family member. RT-PCR using either the 

primers from the 5'- and 3'- untranslated regions (primers A and D, above) or internal 

30 primers were used to amplify cDNA fragments containing the entire coding region. 
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Control fragments were amplified from human cartilage RNA. Direct sequence analysis 
of the PCR products using the primers listed above was used to identify the mutation. To 
confirm the mutation and examine segregation in the Pakistani family, we used Hindi 
digestion of a 108 bp genomic DNA fragment amplified with primers 5'-TGGA 
5 CC AAGGATGACGATGT-3 ' (SEQ. ID. NO.:3; forward primer) and 5'- 
CGGAAAGATGGC AAC AATGG-3 ' (SEQ. ID. NO.:4; reverse primer). 

To define the mouse mutation, RNA was purified from frozen spleen from 
the brachymorphic mouse and a PWK control. (Rusiniak et al [1996]). The Papss2 
coding region was amplified in two fragments by RT-PCR using primer pair (a) (SEQ. 
Q 10 ID. NO. :20; forward primer) plus (g) 5'-CATGG GATGGCGTGAGAT AC-3 5 (SEQ. ID. 
gig NO.:26; reverse primer) and primer pair (b) (SEQ. ID. NO.:21; reverse primer) plus (h) 

!g 5 ' -C AT AAGCTTTGCTTT GGAAGAGT-3' (SEQ. ID. NO. :27; forward primer), and the 

i s * sequence was determined using the primers (c)-(f) listed above. 

ro 

: J 3 

L Example 2: Linkage of SEMD Pakistani type to markers on human chromosome 10 . 

N 15 A pooling strategy was used to carry out a genome-wide search for the 

□ 

chromosomal region containing the disease gene in a large inbred Pakistani SEMD family 

,;s3. 

y (Ahmad et aL 9 1998, incorporated by reference). A total of 381 markers were typed, 

ultimately achieving an average marker spacing of about 10 cM. Individual genotypes for 
all studied family members were determined for 83 markers that showed differences in 
20 allele distributions between the affected and unaffected pools. For all but three of these 
markers, D10S1432, D10S2470, and D10S677, linkage to the SEMD phenotype could be 
excluded. Significant linkage was initially obtained for the marker at locus D10S2470, 
yielding a maximum two-point lod score of 5.44 at a recombination fraction of 0.01. 

Saturation genotyping with additional markers from the region confirmed 
25 the initial linkage result. Two-point lod scores for all markers are shown in Table 1, with 
the highest lod score of 5.51 at a recombination fraction of 0.01, obtained for the marker 
at locus D10S1143. 
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Table 1. Linkage Analysis of Chromosome 10 Markers and SEMD. 
LOD Score at q = 



Marker 


0.0 


0.001 


0.01 


0.05 


0.1 


0.2 


0.3 


0.4 


D10S1689 


-0.20 


-0.12 


0.19 


0.45 


0.41 


0.20 


0.10 


0.02 


D10S1717 


2.85 


3.02 


3.52 


3.62 


3.26 


2.39 


1.50 


0.66 


D10S1644 


4.28 


4.44 


4.88 


4.76 


4.12 


2.78 


1.63 


0.67 


D10S1687 


4.23 


4.41 


4.87 


4.77 


4.18 


2.85 


1.66 


0.67 


D10S1739 


2.16 


2.43 


3.02 


3.08 


2.69 


1.81 


1.02 


0.34 


D10S1143 


4.89 


5.06 


5.51 


5.35 


4.66 


3.14 


1.79 


0.71 


D10S2470 


- CO 


4.63 


5.44 


5.31 


4.60 


3.04 


1.67 


0.67 


D10S1753 


- CO 


-0.57 


1.02 


1.80 


1.74 


1.20 


0.66 


0.20 



A proximal boundary at locus D10S1689 was established by the 
haplotypes of three individuals, VT-2, VI-4 and VIII-2. A distal limit at locus D10S2470 
was established by a recombination observed in VTII-15. Together, these recombinant 
events localized the disease gene within a maximum interval of 8.1 cM, flanked on the 
proximal side by the marker at locus D10S1689 and on the distal side by the marker at 
locus D10S2470. The genotype of individual Vl-7 provided clear evidence of 
recombination within the 8.1 cM interval. Compatible with the recessive inheritance 
pattern for the disease, he was homozygous for the marker at only one locus, D10S2470, 
within the maximum interval. Together with the distal boundary of the larger interval 
established by the recombination in individual VIII- 15, the data showed that the disease 
gene was contained within the 1.7 cM separating the markers at loci D10S1143 and 
D10S2470. 

Figure 1 shows a genetic map of the interval containing the SEMD disease 
gene on human chromosome 10. The darker bar identifies the 8.1 cM maximum interval 
between loci D10S1689 and D10S2470. The 1.7 cM minimum interval is identified by 
the darkest bar, between loci D10S1143 and D10S2470. The locations of D10SI432 and 
DJ0S677 are not drawn to scale. 
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Example 3 : A novel ATP sulfurylase/kinase gene in mice and humans . 

The SEMD interval established by linkage corresponded to chromosomal 
region 10q23-24, which is syntenic with the region of mouse chromosome 19 to which 
the recessive mouse skeletal mutant brachymorphic was localized. This regional 
homology implied that the two phenotypes could be due to mutations in an orthologous 
pair of genes. Additional recombination mapping using recombinant mice from a large 
backcross with the brachymorphic mouse showed that only 3 of 25 mice recombinant at 
the previously defined proximal boundary (Rusiniak, M.E., et al, Molecular markers 
near the mouse brachymorphic (bm) gene, which affects connective tissues and bleeding 
time, Mamm. Genome 7:98-102 [1996]), the markers at loci D19MU39 and D19MU57, 
were recombinant at locus D19MU135. The distal limit was not further narrowed by 
recombination mapping, leaving a total estimated interval of about 0.7 cM, flanked by the 
markers at loci D19MU135 and D19MU64, containing the bm mutation. (Figure 2). 

As in prior genetic mapping studies (Rusiniak et al [1996]), the marker at 
locus D19MU13 was the only marker non-recombinant with the brachymorphic 
phenotype and, because of the large size of the cross, was therefore expected to be 
extremely close to the disease gene. A BLAST search using D19MU13 as the query 
sequence was carried out to test the hypothesis idea that the marker might be within an 
intron of the bm gene. Sequences on both sides of the D19MU13 CA repeat element were 
about 80% homologous to a human expressed sequence tag (EST), zc38a08.rl, which 
was distinct from but similar to the known PAPS synthetase gene, Papssl. Comparison 
of the EST sequence with D19MU13 revealed that the CA repeat was contained within a 
180 base pair intron bordered by consensus splice donor and splice acceptor sequences 
(data not shown). Further sequential BLAST searches using both the human PAPSS1 and 
murine Papssl cDNA sequences, as well as EST sequences derived from each search, 
identified sets of overlapping ESTs from both species that could be used to assemble 
partial cDNA sequences. The ESTs were derived from a variety of tissue and cell types, 
including colon, lung, placenta, aortic endothelium, and fetal heart, suggesting a widely 
expressed gene. (Figure 4). 
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To complete the human cDNA sequence, the nucleotide sequences of 
PAPSS2 cDNA clones isolated from a human fetal cartilage cDNA library were 
determined. The reading frame was determined by comparison with the paralogous 
human PAPSS1 and mouse Papssl cDNA sequences. At the 5 5 end, an in-frame ATG 
codon immediately preceded by a sequence that fit the consensus sequence for translation 
initiation (Kozak, M., An analysis of 5'-noncoding sequences from 699 vertebrate 
messenger RNAs, Nucleic Acids Res. 15:8125-48 [1987]) was assumed to represent the 
first translated codon. Conceptual translation of the 615 codon open reading frame 
predicted a molecular weight of 69,448 Da for the unmodified protein. The coding 
regions of the human PAPSS2 and PAP SSI cDNA sequences were 71% identical and the 
primary translation products were 78% identical. Radiation hybrid mapping localized 
PAPSS2 at 519 cR on the chromosome 10 map, between loci AFM225YD12 and 
D10S583, and within the SEMD region defined by linkage. 

Sequential BLAST searching and sequence analysis of PCR products from 
a mouse brain cDNA library was used to complete the coding sequence of the mouse 
Papss2 cDNA. At the 5' end, there were two in-frame ATG codons, each preceded by a 
possible consensus sequence for translation initiation. An in-frame stop codon was 
located just upstream from the first ATG, further supporting the inference that one or both 
of the two ATG codons represent the site(s) of translation initiation. For purposes of 
comparison with the human orthologue, we have assumed that the first ATG is used. The 
open reading frame contains 617 codons and a molecular weight of 69,814 Da was 
calculated for the encoded protein. The coding regions of the human PAPSS2 and mouse 
Papss2 genes were 86% identical at the cDNA level and 93% identical at the protein 
level. 

Example 4: A stop codon in the human PAPSS2 gene in the Pakistani SEMD family . 

The coding region of the human PAPSS2 cDNA was amplified by RT- 
PCR using RNA derived from a lymphoblastoid cell line from an affected member of the 
Pakistani SEMD family. The only sequence change relative to the control fragment, 
amplified from human fetal cartilage RNA, was a point mutation that predicted a stop 
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codon in place of the serine codon for residue 475 of the protein. (Figure 3). The 
mutation thus predicted truncation of the protein from 615 to 474 amino acids. The 
mutation disrupted a Hindi restriction endonuclease cleavage site, and analysis of an 
amplified genomic DNA fragment containing the site of the mutation by Hindi digestion 
5 demonstrated cosegregation of the mutation and the phenotype in the family; all affected 
individuals were homozygous for the mutation, and all obligate carriers (unaffected 
offspring of affected individuals and unaffected parents of affected individuals) were 
heterozygous for the mutation (data not shown). None of a panel of 43 DNA samples 
from individuals of Pakistani origin carried the mutation. 

10 The data presented here demonstrate that a mutation in a novel gene in the 

sulfate activation pathway results in a human osteochondrodysplasia phenotype. The 
PAPSS2 mutation characterized in the Pakistani SEMD family introduced a nonsense 
mutation at the codon for residue 475 of the 615 amino acid protein. It is unknown 
whether, as has been observed for many other genes (e.g., Willing, M.C., et al 9 

15 Premature chain termination is a unifying mechanism for COL1A1 null alleles in 
osteogenesis imperfecta type I cell strains, Am. J. Hum. Genet. 59:799-809 [1996]), the 
nonsense mutation leads to destabilization of the transcript, or if a truncated protein, 
perhaps retaining partial activity, is synthesized. 

Even though the PAPSS2 gene is expressed in a wide variety of tissues 
20 (Figure 4), the mutation produces an exclusively skeletal phenotype. Thus, similar to the 
disproportionate effect on cartilage of mutations in the gene encoding the widely 
expressed sulfate transporter DTDST (Hastbacka, J., et al [1994]; Rossi, A, et al 
[1998]), the cartilage-specificity of phenotypic expression is likely the result of the high 
demand for sulfation of extracellular matrix molecules in cartilage. Differences in levels 
25 of expression of the ATPSK1 and PAPSS2 transcripts among different tissues 
(unpublished data) suggest that this may also influence the tissue-specificity of the 
phenotype. In cartilage, both PAPSS1 and PAPSS2 transcripts are expressed 
(unpublished data), but the relative abundances of the proteins in vivo, and their 
contributions to net PAPS synthesis, are unknown. The milder effect of a mutation in 
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PAPSS2 as compared with DTDST mutations implies that PAPSS1 partially compensates 
for the PAPSS2 defect by providing a physiologically significant level of PAPS synthesis 
in cartilage. 

Example 5: An Papss2 missense mutation in the brachymorphic mouse . 

Complete sequence analysis of the Papss2 coding region from the 
brachymorphic mouse and the PWK strain used in the backcross identified ten single 
nucleotide sequence differences between the strains [A at base 295 (A295), G342, G386, 
C649, G881, C1086, G1239, T1524, G1557, and T1815]. Of these, only two implied 
sequence differences at the protein level, predicting G79R and K109R substitutions, both 
within the APS kinase domain of the protein. As the normal mouse strain C57BL/6 was 
also homozygous for arginine at residue 109, the G79R was the likely pathogenic 
substitution. A glycine codon is present in the same position among APS kinase domains 
of eucaryotic Papss genes as well as among APS kinases of bacteria, suggesting it is 
essential for APS kinase activity. 

The conclusion that the G79R Papss2 substitution produces the 
brachymorphic phenotype is supported by the following evidence. First, the biochemical 
defect in PAPS synthetase activity measured in brachymorphic cartilage and liver tissue 
extracts, showing markedly decreased APS kinase activity and a lesser effect on ATP 
sulfurylase activity, predicts a structural defect affecting the kinase domain of the protein. 
As the G79R substitution is within the kinase domain, these data are compatible with the 
biochemical evidence and the hypothesis that the abnormal enzyme does not properly 
channel APS to the kinase. (Lyle, S., et a!., Sulfate-activating enzymes in normal and 
brachymorphic mice: evidence for a channeling defect, Biochemistry 34:940-45 [1995]). 
Second, the decrease in Papss activity in liver and cartilage (Schwartz, N.B., et al [1978]; 
Sugahara, K. & Schwartz, N.B. [1979]; Sugahara, K. & Schwartz, N.B., Defect in 3'- 
phosphoadenosine 5 f -phosphosulfate synthesis in brachymorphic mice. IL tissue 
distribution of the defect, Arch. Biochem. Biophys. 214:602-09 [1982]) is correlated with 
the human tissues in which Papssl is highly expressed (unpublished data). Third, the 
characterized mutation is the only structural mutation in the coding region that is unique 
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to the brachymorphic mouse. Fourth, the mutation results in substitution for a highly 
conserved residue among other Papss proteins and APS kinase domains of species from 
vertebrates to bacteria, suggesting it is essential for kinase activity. Fifth, the phenotype 
of the brachymorphic mouse and the human SEMD family are substantially similar. Both 
are recessively inherited dwarfing conditions that affect the spine and long bones. (Lane 
& Dickie [1968]; Ahmad, M., et al [1998]). Both phenotypes are evident at birth, but 
disproportion increases postnatally. Radiographic studies in SEMD Pakistani type 
(Ahmad, M., et al. [1998]) and histologic analysis of brachymorphic mice (Orkin et al. 
[1977]) demonstrate that both phenotypes are characterized by a primarily epiphyseal 
dysplasia with only mild metaphyseal abnormalities. Although brachymorphism is 
produced by a missense mutation and SEMD by a nonsense mutation, the phenotypic 
similarity argues that an Papss2 null allele is likely to produce a mouse 
osteochondrodysplasia phenotype very similar or equivalent to brachymorphic. 

The foregoing examples being illustrative but not an exhaustive 
description of the embodiments of the present invention, the following claims are 
presented. 
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